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(54) Title: CYTOCHROME P450S AND USES THEREOF 

(57) Abstract: The invention features isolated cytochrome P450 polypeptides and nucleic acid molecules, as well as expression 
vectors and transgenic plants containing these molecules. In addition, the invention features uses of such molecules in methods of 
increasing the level of resistance against a disease caused hy a plant pathogen in a transgenic plant, in methods for producing altered 
compounds, for example, hydroxylated compounds, and in methods of producing isoprenoid compounds. 
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CYTOCHROME P450s AND USES THEREOF 



Field of the Invention 



This invention relates to cytochrome P450s and uses thereof 



Background of the Invention 
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Cytochrome P450s encompass a superfamily of oxidases responsible for the 
oxidation of numerous endobiotics and thousands of xenobiotics. In addition, in plants, 
cytochrome P450s play important roles in wound healing, pest resistance, signaling, and 
anti-microbial and anti-fungal activity. 

Capsidiol is a bicyclic, dihydroxylated sesquiterpene produced by many 
Solanaceous species in response to a variety of environmental stimuli, including 
exposure to UV (Back et al., Plant Cell. Physiol. 389:899-904, 1998) and infection by 
microorganisms (Molot et ah, Physiol. Plant Pathol. 379-389, 1981; Stolle et al., 
Phytopathology 78:1 193-1 197, 1988; Keller et al., Planta. 205:467-476, 1998). It is the 
primary antibiotic or phytoalexin produced in tobacco in response to fungal elicitation, 
and it is derived from the isoprenoid pathway via its hydrocarbon precursor, 5-epi- 
aristolochene (Figure 1). Several of the biosynthetic enzymes leading up to 5-epi- 
aristolochene formation have been studied (Chappell, Annu. Rev. Plant Physiol. Plant 
Mol. Biol. 46:521-547, 1995), especially 5-epi-aristolochene synthase (EAS) (VSgeli 
and Chappell, Plant Physiol. 88:1291-1296, 1988; Back and Chappell, Proc. Natl. Acad. 
Sci. U.S.A. 93:6841-6845, 1996; Mathis et al., Biochemistry 36:8340-8348, 1997; 
Starks et al., Science 277:1815-1820, 1997). EAS commits carbon to sesquiterpene 
metabolism by catalyzing the cyclization of farnesyl diphosphate (FPP) to 5-epi- 
aristolochene. However, until the present invention, the enzyme(s) responsible for the 
conversion of 5-epi-aristolochene to capsidiol has yet to be fully identified and 
characterized. 

Biochemical evidence from previous studies in tobacco (Whitehead et al., 
Phytochemistry 28:775-779, 1989) and green pepper (Hoshino et al., Phytochemistry 
38:609-613, 1995) have suggested that the oxidation of 5-epi-aristolochene to capsidiol 
occurs in a two step process with one of the hydroxylation steps being constitutive and 
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the other being mediated by an eJicitor-inducible cytochrome P450 (Figure 1). Because 
1-deoxycapsidiol had been isolated from natural sources (Watson et al., Biochem. Soc. 
Trans. 1 1 :589, 1983), Whitehead et al. (Phytochemistry 28:775-779, 1989), surmised 
that perhaps the biosynthesis of this intermediate was due to pathogen induction of a 
5 corresponding hydroxylase. They therefore prepared synthetic 1-deoxycapsidiol and 
reported a modest conversion of this compound to capsidiol when fed to control or 
unelicited tobacco cell cultures. This was further supported by their observation that 
radiolabeled 5-epi-aristolochene was only converted to capsidiol when fed to elicitor- 
induced cell cultures but not control cultures. Whitehead et al. (Phytochemistry 28:775- 

10 779, 1989) therefore concluded that the 3-hydroxylase, responsible for hydroxylation of 
5-epi-aristolochene at C3 to generate 1-deoxycapsidiol, was pathogen/elicitor inducible, 
while the 1 -hydroxy lase, responsible for hydroxylating 1-deoxycapsidiol at the CI to 
generate capsidiol, was constitutive. Hoshino et al. (Phytochemistry 38:609-613, 1995) 
added to the observations of Whitehead et al. (Phytochemistry 28:775-779, 1989) by 

15 directly measuring 3-hydroxylase-activity in microsomal preparations of arachidonic 
acid-elicited Capsicum annuum fruits and seedlings. These assays consisted of 
incubating 5-epi-aristolochene with microsome preparations and subsequently 
determining the amount of 1 -deoxycapsidiol generated by a combination of thin-layer 
chromatography (TLC) separations and gas chromatography (GC). Their evidence 

20 demonstrated that the conversion of 5-epi-aristolochene to 1-deoxycapsidiol was 

dependent on both NADPH and O2, and that 1-deoxycapsidiol accumulation in vitro was 
arrested by the P450 antagonists carbon monoxide (Omura and Sato, J. Biol. Chem. 
239:2370-2378, 1964), ancymidol (Coolbaugh et al., Plant Physiol. 62:571-576, 1978), 
and ketoconazole (Rademacher, Annu. Rev. Plant Physiol. Plant Mol. Biol. 51:501-531, 

25 2000). 

Recent results suggest that the hydroxylation of 5-epi-aristolochene is an 
important regulated step in capsidiol biosynthesis. In studies to evaluate the 
effectiveness of methyl-jasmonate as an inducer of capsidiol biosynthesis in tobacco cell 
cultures, Mandujano-Chavez et al. (Arch. Biochem. Biophys. 381 :285-294, 2000), 
30 reported that the modest accumulation of this phytoalexin was accompanied by a strong 
induction of EAS. This result implied that steps before or after the sesquiterpene cyclase 
reaction were limiting. Using an in vivo assay measuring the conversion rate of 
radiolabeled 5-epi-aristolochene to capsidiol, a very limited induction of the hydroxylase 
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activities was observed in cells treated with methyl jasmonate relative to that in fungal 
elicitor-treated cells. This result pointed to the hydroxylase reactions as a potentially 
limiting step in capsidiol biosynthesis. 

5 Summary of the Invention 

In one aspect, the invention features several isolated cytochrome P450 
polypeptides (such as CYP71D20, CYP71D21, CYP73A27, CYP73A28, and CYP92A5, 
and P450s having substantial identity to these polypeptides), as well as isolated nucleic 
acid molecules that encode these P450s. 
10 In related aspects, the invention features a vector (such as an expression vector) 

including an isolated nucleic acid molecule of the invention and a cell (for example, a 
prokaryotic cell, such as Agrobacterium or E. coli> or a eukaryotic cell, such as a 
mammalian, insect, yeast, or plant cell) including the isolated nucleic acid molecule or 
vector. 

1 5 In yet another aspect, the invention features a transgenic plant or transgenic plant 

component including a nucleic acid molecule of the invention, wherein the nucleic acid 
molecule is expressed in the transgenic plant or the transgenic plant component. 
Preferably, the transgenic plant or transgenic plant component is an angiosperm (for 
example, a monocot or dicot). In preferred embodiments, the transgenic plant or 

20 transgenic plant component is a solanaceous, maize, rice, or cruciferous plant or a 

component thereof. The invention further includes a Seed produced by the transgenic 
plant or transgenic plant component, or progeny thereof. 

In another aspect, the invention features a method of providing an increased level 
of resistance against a disease caused by a plant pathogen in a transgenic plant. The 

25 method involves: (a) producing a transgenic plant cell including the nucleic acid 

molecule of the invention integrated into the genome of the transgenic plant cell and 
positioned for expression in the plant cell; and (b) growing a transgenic plant from the 
plant cell wherein the nucleic acid molecule is expressed in the transgenic plant and the 
transgenic plant is thereby provided with an increased level of resistance against a 

30 disease caused by a plant pathogen. 

In another aspect, the invention features a method for producing an altered 
compound, the method including the steps of contacting the compound with one or more 
of the isolated polypeptides disclosed herein under conditions allowing for the 
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hydroxy lation, oxidation, demethylation, or methylation of the compound and 
recovering the altered compound. 

In still another aspect, the invention features a hydroxylating agent including any 
of the isolated polypeptides disclosed herein. 
5 In yet another embodiment, the invention features an isolated nucleic acid 

molecule that specifically hybridizes under highly stringent conditions to the 
complement of any one of the sequences described in SEQ ID NO:2 (CYP71D20), SEQ 
ID NO:4 (CYP71D21), SEQ ID NO:6 (CYP73A27), SEQ ID NO:8 (CYP73A28), or 
SEQ ID NO: 12 (CYP92A5), wherein such a nucleic acid molecule encodes a 
10 cytochrome P450 polypeptide. 

In another aspect, the invention features a host cell expressing a recombinant 
isoprenoid synthase and a recombinant cytochrome P450. In preferred embodiments, 
the host cell further expresses, independently or in combination, a recombinant 
acetylrxansferase, methyltransferase, or fatty acyltransferase. In other preferred 
15 embodiments, the host expresses an endogenous or recombinant cytochrome reductase. 
Preferably, the host cell is a yeast cell, a bacterial cell, an insect cell, or a plant cell. 

In a related aspect, the invention features a method for producing an isoprenoid 
compound, the method including the steps of: (a) culturing a cell that expresses a 
recombinant isoprenoid synthase and a recombinant cytochrome P450 under conditions 
20 wherein the isoprenoid synthase and the cytochrome P450 are expressed and catalyze 
the formation of an isoprenoid compound not normally produced by the cell; and (b) 
recovering the isoprenoid compound. In preferred embodiments, the host cell further 
expresses a recombinant acetyltransferase, a recombinant methyltransferase, or a 
recombinant fatty acyltransferase. In other preferred embodiments, the host cell 
25 expresses an endogenous or recombinant cytochrome reductase. Preferably, the host cell 
is a yeast cell, a bacterial cell, an insect cell, or a plant cell. 

In yet another aspect, the invention features an isoprenoid compound produced 
according to the above-mentioned methods. 

By " P450 polypeptide," "cytochrome P450," or "P450" is meant a polypeptide 
30 that contains a heme-binding domain and shows a CO absorption spectra peak at 450 nm 
according to standard methods, for example, those described herein. Such P450s may 
also include, without limitation, hydroxylase activity, dual hydroxylase activity, 
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demethylase activity, or oxidase activity. Such enzymatic activities are determined 
using methods well known in the art. 

By "polypeptide" is meant any chain of amino acids, regardless of length or post- 
translational modification (for example, glycosylation or phosphorylation). 
5 By "substantially identical" is meant a polypeptide or nucleic acid exhibiting at 

least 80 or 85%, preferably 90%, more preferably 95%, and most preferably 97%, or 
even 98% identity to a reference amino acid sequence (for example, the amino acid 
sequence shown in SEQ ID NOS: 1, 3, 5, 7, and 1 1) or nucleic acid sequence (for 
example, the nucleic acid sequences shown in SEQ ID NOS:2, 4, 6, 8, and 12, 

10 respectively). For polypeptides, the length of comparison sequences will generally be at 
least 16 amino acids, preferably at least 20 amino acids, more preferably at least 25 
amino acids, and most preferably 35 amino acids. For nucleic acids, the length of 
comparison sequences will generally be at least 50 nucleotides, preferably at least 60 
nucleotides, more preferably at least 75 nucleotides, and most preferably 110 

1 5 nucleotides. 

Sequence identity is typically measured using sequence analysis software (for 
example, Sequence Analysis Software Package of the Genetics Computer Group, 
University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, WI 
53705, BLAST, or PILEUP/PRETTYBOX programs). Such software matches identical 

20 or similar sequences by assigning degrees of homology to various substitutions, 

deletions, and/or other modifications. Conservative substitutions typically include 
substitutions within the following groups: glycine alanine; valine, isoleucine, leucine; 
aspartic acid, glutamic acid, asparagine, glutamine; serine, threonine; lysine, arginine; 
and phenylalanine, tyrosine. 

25 By an "isolated polypeptide" is meant a P450 polypeptide (for example, a 

CYP71D20 (SEQ ID NO: 1), CYP71D21 (SEQ ID NO:3), CYP73A27 (SEQ ID NO:5), 
CYP73A28 (SEQ ID NO:7), or CYP92A5 (SEQ ID NO:l 1) polypeptide) that has been 
separated from components that naturally accompany it. Typically, the polypeptide is 
isolated when it is at least 60%, by weight, free from the proteins and naturally- 

30 occurring organic molecules with which it is naturally associated. Preferably, the 

preparation is at least 75%, more preferably at least 90%, and most preferably at least 
99%, by weight, a P450 polypeptide. An isolated P450 polypeptide may be obtained, 
for example, by extraction from a natural source (for example, a plant cell); by 
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expression of a recombinant nucleic acid encoding a P450 polypeptide; or by chemically 
synthesizing the protein. Purity can be measured by any appropriate method, for 
example, column chromatography, polyacrylamide gel electrophoresis, or by HPLC 
analysis. 

5 By "derived from" or "obtained from" is meant isolated from or having the 

sequence of a naturally-occurring sequence (e.g., cDNA, genomic DNA, synthetic, or 
combination thereof). 

By "isolated nucleic acid molecule" is meant a nucleic acid molecule, e.g., a 
DNA molecule, that is free of the nucleic acid sequence(s) which, in the naturally- 

10 occurring genome of the organism from which the nucleic acid molecule of the 

invention is derived, flank the nucleic acid molecule. The term therefore includes, for 
example, a recombinant DNA that is incorporated into a vector; into an autonomously 
replicating plasmid or virus; or into the genomic DNA of a prokaryote or eukaryote; or 
that exists as a separate molecule (for example, a cDNA or a genomic or cDNA 

1 5 fragment produced by PCR or restriction endonuclease digestion) independent of other 
sequences. The term "isolated nucleic acid molecule" also includes a recombinant DNA 
which is part of a hybrid gene encoding additional polypeptide sequence. 

By "specifically hybridizes" is meant that a nucleic acid sequence is capable of 
hybridizing to a DNA sequence at least under low stringency conditions, and preferably 

20 under high stringency conditions. For example, high stringency conditions may include 
hybridization at approximately 42°C in about 50% formamide, 0.1 mg/ml sheared 
salmon sperm DNA, 1% SDS, 2X SSC, 10% Dextran sulfate, a first wash at 
approximately 65°C in about 2X SSC, 1% SDS, followed by a second wash at 
approximately 65°C in about 0.1X SSC. Alternatively high stringency conditions may 

25 include hybridization at approximately 42°C in about 50% formamide, 0.1 mg/ml 
sheared salmon sperm DNA, 0.5% SDS, 5X SSPE, IX Denhardt's, followed by two 
washes at room temperature in 2X SSC, 0.1% SDS, and two washes at between 55-60°C 
in 0.2X SSC, 0.1% SDS. Reducing the stringency of the hybridization conditions may 
involve lowering the wash temperature and/or washing at a higher concentration of salt. 

30 For example, low stringency conditions may include washing in 2X SSC, 0.1% SDS at 
40°C. 
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By "transformed cell" is meant a cell into which (or into an ancestor of which) 
has been introduced, by means of recombinant DNA techniques, a DNA molecule 
encoding (as used herein) a P450 polypeptide. 

By "positioned for expression" is meant that the DNA molecule is positioned 
5 adjacent to a DNA sequence which directs transcription and translation of the sequence 
(i.e., facilitates the production of, for example, a P450 polypeptide, a recombinant 
protein, or an RNA molecule). 

By "reporter gene" is meant a gene whose expression may be assayed; such 
genes include, without limitation, beta-glucuronidase (GUS), luciferase, 
10 chloramphenicol transacetylase (CAT), green fluorescent protein (GFP), beta- 
galactosidase, herbicide resistant genes, and antibiotic resistance genes. 

By "expression control region" is meant any minimal sequence sufficient to 
direct transcription. Included in the invention are promoter elements that are sufficient 
to render promoter-dependent gene expression controllable for cell-, tissue-, or organ- 
15 specific gene expression, or elements that are inducible by external signals or agents (for 
example, light-, pathogen-, wound-, stress-, or hormone-inducible elements or chemical 
inducers such as salicylic acid (SA) or 2,2-dichloro isonicotinic acid (INA)); such 
elements may be located in the 5' or 3 f regions of the native gene or engineered into a 
transgene construct. 

20 By "operably linked" is meant that a gene and a regulatory sequence(s) are 

connected in such a way as to permit gene expression when the appropriate molecules 
(for example, transcriptional activator proteins) are bound to the regulatory sequence(s). 

By "plant cell" is meant any self-propagating cell bounded by a semi-permeable 
membrane and typically is one containing a plastid. Such a cell also requires a cell wall 
25 if further propagation is desired. Plant cell, as used herein includes, without limitation, 
algae, cyanobacteria, seeds, suspension cultures, embryos, meristematic regions, callus 
tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores. 

By "plant component" is meant a part, segment, or organ obtained from an intact 
plant or plant cell. Exemplary plant components include, without limitation, somatic 
30 embryos, leaves, stems, roots, flowers, tendrils, fruits, scions, and rootstocks. 

By "transgene" is meant any piece of DNA which is inserted by artifice into a 
cell and typically becomes part of the genome, for example, the nuclear or plastidic 
genome, of the organism which develops from that cell. Such a transgene may include a 
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gene which is partly or entirely heterologous (i.e., foreign) to the transgenic organism, or 
may represent a gene homologous to an endogenous gene of the organism. 

By "transgenic" is meant any cell which includes a DNA sequence which is 
inserted by artifice into a cell and becomes part of the genome of the organism which 
5 develops from that cell. As used herein, the transgenic organisms are generally 
transgenic plants and the DNA (transgene) is inserted by artifice into the nuclear or 
plastidic genome. A transgenic plant according to the invention may contain one or 
more engineered traits. 

By "pathogen" is meant an organism whose infection of viable plant tissue elicits 

10 a disease response in the plant tissue. Such pathogens include, without limitation, 

bacteria, mycoplasmas, fungi, insects, nematodes, viruses, and viroids. Plant diseases 
caused by these pathogens are described in Chapters 1 1-16 of Agrios, Plant Pathology, 
3rd ed., Academic Press, Inc., New York, 1988. 

By "increased level of resistance" is meant a greater level of resistance to a 

15 disease-causing pathogen in a transgenic plant (or cell or seed thereof) of the invention 
than the level of resistance relative to a control plant (for example, a non-transgenic 
plant). In preferred embodiments, the level of resistance in a transgenic plant of the 
invention is at least 20% (and preferably 30% or 40%) greater than the resistance of a 
control plant. In other preferred embodiments, the level of resistance to a disease- 

20 causing pathogen is 50% greater, 60% greater, and more preferably even 75% or 90% 
greater than a control plant; with up to 100% above the level of resistance as compared 
to a control plant being most preferred. The level of resistance is measured using 
conventional methods. For example, the level of resistance to a pathogen may be 
determined by comparing physical features and characteristics (for example, plant height 

25 and weight, or by comparing disease symptoms, for example, delayed lesion 

development, reduced lesion size, leaf wilting and curling, water-soaked spots, and 
discoloration of cells) of transgenic plants. 

By "purified antibody" is meant antibody which is at least 60%, by weight, free 
from proteins and naturally-occurring organic molecules with which it is naturally 

30 associated. Preferably, the preparation is at least 75%, more preferably 90%, and most 
preferably at least 99%, by weight, antibody, for example, an acquired resistance 
polypeptide-specific antibody. A purified P450 antibody may be obtained, for example, 
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by affinity chromatography using a recombinantly-produced P450 polypeptide and 
standard techniques. 

By "specifically binds" is meant an antibody which recognizes and binds a P450 
protein but which does not substantially recognize and bind other molecules in a sample, 
5 for example, a biological sample, which naturally includes a P450 protein such as 
CYP71D20, CYP71D21, CYP73A27, CYP73A28, or CYP92A5. 

Other features and advantages of the invention will be apparent from the 
following description of the preferred embodiments thereof, and from the claims. 

10 Brief Description of the Drawings 

Figure 1 is a schematic diagram of a proposed alternative pathway for the 
biosynthesis of capsidiol in elicitor-treated Nicotiana tabacum cells. 5-epi-aristolochene 
is synthesized from FPP by the action of a sequiterpene cyclase, 5-epi-aristolochene 
synthase (EAS), and is subsequently hydroxylated at CI and C3 to form capsidiol. 

15 Figure 2 is a graph showing an induction time course for sesquiterpene cyclase 

enzyme activity and sesquiterpene hydroxylase activity in cellulase-treated cell cultures. 
Sesquiterpene cyclase (5-epi-aristolochene synthase, EAS) enzyme activity was 
determined in extracts prepared from control (open squares) and elicitor-treated (closed 
squares) cells collected at the indicated time points. Sesquiterpene hydroxylase activity 

20 was determined using an indirect assay for control (open circles) and elicitor-treated 
(closed circles) cells. Cell cultures were incubated with [ 3 H]- 5-epi-aristolochene for 3 
hours ending at the indicated time points before quantifying the incorporation of 
radioactivity into extracellular capsidiol, a dihydroxylated form of aristolochene 
(Mandujano-Chavez et al., Arch. Biochem. Biophys. 381:285-294, 2000). 

25 Figure 3 is a series of graphs showing the dose dependent inhibition of 5-epi- 

aristolochene hydroxylase activity by ancymidol and ketoconazole. Cell cultures were 
incubated in the presence of ceilulase (0.5 jig/mL) plus the indicated concentrations of 
ancymidol (A) or ketoconazole (B) for 12 hours prior to measuring the in vivo 5-epi- 
aristolochene hydroxylase activity in the cell suspension cultures (squares), or the EAS 

30 enzyme activity in extracts prepared from the collected cells (triangles). The in vitro 
activity of a purified EAS preparation (Back and Chappell, J. Biol. Chem. 270:7375- 
7381 , 1995) was also measured at the indicated inhibitor concentrations as an additional 
test for non-specific effects of these inhibitors (circles). 
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Figure 4A is a schematic diagram of the primary structure of a generalized 
cytochrome P450 with conserved domains used for the design of PCR primers 
highlighted (SEQ ID NOS:26-29). 

Figure 4B is a list of the degenerate P450-specific primers (SEQ ID NOS:30-36) 
5 that were used in various combinations with vector specfic primers in the amplification 
of cytochrome P450 cDNA fragments. 

Figure 4C is a scanned image of an ethidium bromide-stained agarose gel 
showing the PCR products amplified from a directional cDNA library prepared with 
mRNA isolated from elicitor-treated cells using the degenerate primer GRRXCP(A/G)- 
10 for (SEQ ID NO:35) and the T7 vector-specific primer (SEQ ID NO:37). The T3 
vector-specific primer is also shown (SEQ ID NO:38). 

Figure 5 is a series of Northern blots showing the induction time course for 
CYP71D, CYP73A, CYP82E, CYP92A, and EAS transcript accumulation in elicitor 
treated cells. Total RNA was extracted from tobacco suspension cells incubated with 
1 5 the cellulase elicitor for the indicated durations, size fractionated by agarose gel 

electrophoresis under denaturing conditions, and transferred to a nylon membrane before 
probing with the respective full-length cDNAs. The uniformity of sample loading was 
verified by ethidium bromide staining of ribosomal RNA (Loading control). 

Figure 6 is a series of graphs showing carbon monoxide (CO) difference spectra 
20 of the microsomal fraction isolated from yeast expressing the CYP92 A5 (A) and 

CYP71D20 (B) cDNAs. Expression of the respective plasmid constructs engineered 
into the yeast (WAT1 1) cells was induced by a galactose treatment, followed by 
isolation of microsomal preparations. The difference adsorption spectra of microsomes 
incubated in the presence (solid lines) and absence (broken lines) of carbon monoxide 
25 was determined. 

Figure 7 is a series of gas chromatograms of the reaction products formed upon 
incubation of microsomes isolated from WAT1 1 yeast cells containing the CYP71D20 
expression construct (A and C) or vector control DNA (B and D) with sesquiterpene 
substrates. Microsomes isolated from the indicated yeast lines were incubated with 5- 
30 epi-aristolochene (A and B) or 1-deoxycapsidiol (C and D) in the presence (solid lines) 
or absence (dashed lines) of NADPH. The identities of 5-epi-aristolochene, 1- 
deoxycapsidiol, and capsidiol were verified by mass spectrometry. 
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Figure 8 is a sequence comparison of the amino acid sequence of Nicotiana 
tabacum 5-epi-aristolochene (sesquiterpene) hydroxylase NtCYP71D20 (SEQ ID NO:l) 
with other plant terpene hydroxylases (SEQ ID NOS:39-43). NrCYP71 A5vl (GenBank 
accession number CAA70575) catalyzes the mono-hydroxylation of nerol and geraniol, 
5 linear monoterpenes, while PaCYP71 Al (A35867) catalyzes the epoxidation of these 
substrates (Hallahan et al., Biochim. Biophys. Acta. 1201:94-100, 1994). MsCYP71D18 
(AAD44150) and MpCYP71D13 (AAD44151) catalyze the mono-hydroxylation at C6 
and C3 of limonene, a cyclic monoterpene, respectively (Lupien et al., Arch. Biochem. 
Biophys. 368:181-192, 1999). AtCYP701 A3 (AAC39505) encodes for kaurene oxidase, 

10 which catalyzes a 3-step reaction including a hydroxylation followed by oxidation of a 
diterpene (Helliwell et al., Plant Physiol. 1 19:507-510, 1999). Shown are sequences 
from Mentha piperita (MpCYP71D13; SEQ ID NO:39), Mentha spicata 
(MsCYP71D18; SEQ ID NO:40), Nepeta racemosa (NrCYP71A5vl; SEQ ID NO:41), 
Nicotiana tabacum (NtCYP71D20; SEQ ID NO:l), Persea americana (PaCYP71 Al; 

1 5 SEQ ID NO:42), and Arabidopsis thaliana (CYP701 A3; SEQ ID NO:43). Conserved 
residues are shaded. 

Detailed Description 
Capsidiol is a bicyclic, dihydroxylated sesquiterpene produced by several 

20 Solanaceous species in response to a variety of environmental stimuli. It is the primary 
antimicrobial compound produced by Nicotiana tabacum in response to fungal 
elicitation, and it is formed via the isoprenoid pathway from 5-epi-aristolochene. Much 
of the biosynthetic pathway for the formation of this compound has been elucidated, 
except for the enzyme(s) responsible for the conversion of the allylic sesquiterpene 5- 

25 epi-aristolochene to its dihydroxylated form, capsidiol. 

Accordingly, an in vivo assay for 5-epi-aristolochene hydroxylase-activity was 
developed and used to demonstrate a dose dependent inhibition of activity by ancymidol 
and ketoconazole, two well-characterized inhibitors of cytochrome P450 enzymes. 
Using degenerate oligonucleotide primers designed to the well-conserved domains found 

30 within most P450 enzymes, including the heme binding domain, cDNA fragments 
representing four distinct P450 families (CYP71, CYP73, CYP82, and CYP92) were 
amplified from a cDNA library prepared against mRNA from elicitor-treated cells using 
PCR. The PCR fragments were subsequently used to isolate full-length cDNAs 
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(CYP71D20 (SEQ ID NO: 2) and D21 (SEQ ID NO: 4), CYP73A27 (SEQ ID NO: 6) 
and A28 (SEQ ID NO: 8), CYP82E1 (SEQ ID NO: 10), and CYP92A5 (SEQ ID NO: 
12)), and these in turn were used to demonstrate that the corresponding mRNAs were all 
induced in elicitor-treated cells, albeit with different induction patterns. 
5 There now follows a description of the cloning of several P450s from Nicotiana 

tabacum. These examples are provided for the purpose of illustrating the invention, and 
are not to be considered as limiting. 

Inhibition of the 5-epi-aristolochene to capsidiol conversion by P450 antagonists 

10 Using an indirect assay, a detailed induction time course of 5EAH activity in 

elicitor-induced cell cultures was determined relative to that of EAS activity (Figure 2), 
the we 11 -characterized sesquiterpene cyclase activity that catalyzes the formation of 5- 
epi-aristolochene from FPP (Figure 1). Using assays for EAS and 5EAH, EAS activity 
is not detectable in control cell cultures, but is induced significantly within 3 hours and 

15 reaches its maximal level within 15 to 18 hours of elicitor-treatment. Similar to the EAS 
enzyme activity, 5EAH activity was negligible in control cell cultures. Nonetheless, 
after an apparent lag phase of 8 hours, a rapid induction of hydroxylase activity was 
observed 10 to 15 hours post elicitor addition to the cell cultures, reaching a maximum 
by 18 hours followed by a rather gradual decline of 10 to 20% over the next 8 hours. 

20 Tobacco cell suspension cultures treated with cellulase plus varying 

concentrations of ancymidol or ketoconazole were pre-incubated for 12 hours before 
measuring the cells' ability to convert exogenous supplied [ 3 H] labeled 5-epi- 
aristolochene to radiolabeled capsidiol during a subsequent 3 hour incubation period 
(Figure 3). Apparent activity of 5EAH was inhibited in a dose-dependent manner with 

25 approximately 50% inhibition by either 25 \iM ancymidol or ketoconazole, and more 
than 80% by 75 jiM ancymidol and 95% by 100 uM ketoconazole (Figure 3 A and B). 
Importantly, neither the in vitro activity of recombinant EAS nor the induction of EAS 
in the elicitor-treated cell cultures was significantly affected by ancymidol at 
concentrations as high as 100 uM (Figure 3 A). Ketoconazole also does not appear to 

30 affect the in vitro activity of EAS. However, the inducibility of cyclase activity in 

elicitor-treated cell extracts was inhibited by ketoconazole at concentrations above 50 
uM (Figure 3B). Therefore, the specificity of ketoconazole as an inhibitor of P450 type 
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reactions should be assessed at or below a concentration of 50 uM under these 
experimental conditions. 

Isolation of elicitor-inducible cytochrome P450 cDNAs 
5 A two-step approach for the isolation of candidate P450 cDNAs was followed. 

A PCR strategy was first employed using a directional cDNA library prepared against 
mRNA isolated from elicitor-induced cells as the template and degenerate PCR primers 
(Figure 4). Sequence alignments of cytochrome P450s from multiple families across 
kingdoms were used to identify conserved regions to which a series of degenerate 

10 primers were prepared (Figure 4 A and B). In cloning experiments, 450 to 550 bp 

products were expected from reactions utilizing the primer prepared to the heme-binding 
domain (GRRXCP(A/G)) (SEQ ID NOS:27 and 28) and the T7 vector primer (Figure 
4C). The mixtures of reaction products were shotgun cloned, and approximately 100 of 
the cloned PCR fragments were sequenced. About half of the sequenced DN As 

15 contained signature sequences typical of P450 enzymes as revealed by BlastX database 
searches, and these corresponded to typical plant P450 family members of the CYP71, 
CYP73, CYP92, and CYP82 classes. Each of these PCR fragments was isolated 
multiple times in separate experiments. In addition, we isolated full-length cDNAs for 
these P450 family members. Table 1 compares the similarity and identity of the full- 

20 length cDNAs of P450 family members with those of their nearest family member in the 
GenBank database. In addition, Figure 8 shows an amino acid alignment of several 
terpene cytochrome P450s. Alignments were performed using the algorithm of the 
MACVECTOR software suite. 
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Table 1: Full-length cDNAs cloned from an elicited cDNA library 



Cytochrome P450 
cDNA clone 


Nearest relative/ 
accession number 


% identity 


% similarity 


CYP71D20 


CYP71D7 (S. chacoense) 
Gen EMBL U48435 


76.5 


88.8 


CYP71D21 


CYP71D7 (S. chacoense) 
Gen EMBL U48435 


76.3 


88.8 


CYP73A27 


CYP73A1 5 (P. vulgaris) 
Gen EMBL Y09447 


79.4 


92.6 


CYP73A28 


CYP73A1 5 (P. vulgaris) 
Gen EMBL Y09447 


79.2 


92.4 


! CYP82E1 


CYP82E1 (/V. tabacum) 
Gen EMBL AB01 5762 


100.0 


100.0 I 


CYP92A5 


CYP92A3 IN. tabacum) 
Gen EMBL X96784 


95.5 


98.6 



The cloned fragments were used in a second step to isolate full-length clones from 
the cDNA library. Screening the cDNA library by hybridization with the CYP7 1 and 
5 CYP73 gene fragments yielded four full-length cDNAs, two CYP71Ds and two 

CYP73As. The former clones were designated CYP71D20 and CYP71D21 , and the 
latter were designated CYP73A27 and CYP73A28. The other two cDNA fragments 
corresponded to tobacco cDNAs already found in the GenBank database, CYP82E1 and 
CYP92A3. These two cDNAs were cloned using specific primers designed with the 
10 help of the available sequence information to amplify the full-length cDNA 

Induction of cytochrome P45Q mRNAs in el ici tor- treated cells 

To correlate a biochemical role for P450s in sesquiterpene metabolism, RNA 
blot analyses were used to determine the steady-state levels of the mRNAs coding for all 
15 four of the cytochrome P450 clones and EAS in control and elicitor-treated cells (Figure 
5). The mRNAs for all four of the P450s were rapidly and transiently induced with 
slightly different time courses relative to one another and to the EAS mRNA. 
CYP73A27 mRNA, for instance, displayed an induction pattern similar to that of EAS 
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with the maximum mRNA level occurring 9 to 12 hours after elicitation. While the EAS 
mRNA remained high throughout the duration of the experiment, the CYP73A27 
mRNA was negligible in cells 24 hours after elicitor-treatment In contrast, the 
CYP71D mRNA was more rapidly induced than the EAS mRNA, reached its maximum 
5 6 to 9 hours after elicitation, and was declining by 1 2 hours when the EAS mRNA level 
was still very high. 

Functional identification of CYP71D20 as 5-epi-aristoIochene hydroxylase 

To ascribe functional identity to the various P450 cDNAs, full-length cDNAs for 

10 CYP71D20, CYP82E1 and CYP92A5 were inserted into the yeast expression vector 

pYeDP60 (Urban et al., Biochimie 72:463-472, 1990; Pompon et al., Methods Enzymol. 
272:5 1 -64, 1 996) and the expression of each in WAT1 1 , a yeast line containing an 
integrated Arabidopsis thaliana cytochrome reductase gene (Pompon et al., Methods 
Enzymol. 272:51-64, 1996; Urban et al., J. Biol. Chem. 272:19176-19186, 1997), was 

1 5 determined. Engineering the CYP73 A27 cDNA required an extra modification because 
of an unusually long N-terminus with several hydrophilic residues that may interfere 
with proper intracellular targeting (Nedelkina et al., Plant Mol. Biol. 39:1079-1090, 
1999). This unusual leader sequence therefore was replaced with the membrane 
anchoring sequence of CYP73A1, a cinnamate 4-hydroxylase previously demonstrated 

20 to express well in yeast (Fahrendorf and Dixon, Arch. Biochem. Biophys. 305:509-515, 
1993; Pompon et al., Methods Enzymol. 272:51-64, 1996). Expression of all these 
cDNAs was under the control of the glucose-repressible, galactose-inducible GAL 10- 
CYC1 promoter (Guarente et al., Proc. Natl. Acad. Sci. U.S.A. 79:7410-7414, 1982), 
and expression was compared to yeast transformed with the parent pYeDP60 vector 

25 (control) alone. 

After induction with galactose for approximately 1 6 hours, control cells and cells 
containing the various P450 constructs were collected, and microsomes prepared from 
each were analyzed for general P450 expression by CO-difference spectroscopy (Omura 
and Sato, J. Biol. Chem. 239:2370-2378, 1964). Microsomes prepared from cells 

30 containing the CYP71D20 (Figure 6A) and CYP92A5 (Figure 6B) constructs both 

showed characteristic CO difference spectra with peaks at 450 nm, indicating that the 
encoded proteins were assembling properly with their heme cofactor. Using the 
extinction coefficient of 91 mM'^cm' 1 for heme binding proteins (Omura and Sato, J. 
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Biol. Chem. 239:2370-2378, 1964), it was determined that approximately 107 pmol of 
CYP71D20 and 268 pmol of CYP92A5 were expressed in the yeast cells per milligram 
of total yeast protein. 

Both 5-epi-aristolochene and 1 -deoxycapsidiol were metabolized to only one 
5 product with the same retention time as capsidiol. Obvious by its absence, no reaction 
product having a retention time similar to deoxycapsidiol was detectable in the 5-epi- 
aristolochene incubations (Figure 7). Co-injection of authentic capsidiol with the 
respective reaction products resulted in a single GC peak having a 16.2 minute retention 
time, identical to capsidiol. Mass spectra patterns for the separate reaction products 

10 were identical to that for the capsidiol standard (EIMS m/z 236, 221, 203, 185, 175, 163, 
157, 133, 121, 107, 93, 79, 67, 55, 43, 41). 

The in vivo assay data presented in Figures 2 and 3 of the current work indicate 
that the conversion of 5-epi-aristolochene is catalyzed by at least one inducible 
cytochrome P450 mediated reaction. 

15 Furthermore, any of the cytochrome p450 polypeptides described herein may 

include one or more hydroxylase activities which can incorporate hydroxyl groups into 
at least two distant sites on an isoprenoid compound. The addition of these hydroxyl 
groups may occur, for example, sequentially, by adding a hydroxyl group first to one site 
and then the other, in either order. Moreover, such hydroxylases may be mutated to 

20 limit their ability to hydroxylate a substrate at only one site, or, alternatively, to provide 
stereochemical specificity to their hydroxylating activity. 

The above-described experiments were performed using the following materials 
and methods. 



25 Chemicals 

Standard laboratory reagents were purchased from Becton Dickinson 
Microbiology Systems (Sparks, MD), FisherBiotech (Fair Lawn, NJ) and Sigma 
Chemical Company (St. Louis, MO). 

30 Biological materials and induction treatments 

Nicotiana tabacum cv. KYI 4 plants and cell suspension cultures were used. Cell 
suspension cultures were maintained in modified Murashige-Skoog (Vogeli and 
Chappell, Plant Physiol. 88:1291-1296, 1988). Cultures in their rapid phase of growth 
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(3 days old) were used for all experiments. At the indicated times, cells were collected 
and separated from media by vacuum filtration and stored at -80°C. 

Induction treatments were performed by the addition of the fungal elicitors, 
cellulase {Trichoderma viride, Type RS, Onozuka) or paraciticein (O'Donohue et al., 
5 Plant Mol. Biol. 27:577-586, 1995) at the indicated concentrations. Paraciticein was 
purified from E. coli cells overexpressing a recombinant paraciticein protein containing 
a carboxy-terminal histidine purification tag. 

In vivo 5-epi-aristolochene hydroxylase assay and inhibition studies 

1 0 5-epi-aristolochene hydroxylase-activity was measured as the incorporation of 

[ 3 H]-5-epi-aristolochene into extracellular capsidiol by intact cells. [ 3 H]-5-epi- 
aristolochene was produced by incubating an excess of [1- 3 H] famesyl diphosphate (1 
HM, 20.5 Ci/mmol) with recombinant 5-epi-aristolochene synthase (Back et al., Arch. 
Biochem. Biophys. 315:527-532, 1994; Rising et al., J. Am. Chem. Soc. 122:1861-1866, 

15 2000). The hexane extractable radioactivity from reactions was treated with a small 

amount of silica to remove any farnesol or residual FPP before quantifying the yield of 
radioactive 5-epi-aristolochene by liquid scintillation counting. The hexane solvent was 
removed under a gentle stream of N 2 gas, and the dried residue was re-dissolved in 
acetone. Control and elici tor-treated cells were then incubated with [ 3 H]-5-epi- 

20 aristolochene (approximately 100,000 dpm at 2.5 nM) for 3 hour periods at various 
points during an induction time course before collecting the cell and media samples. 
Detection and quantification of capsidiol in the extracellular culture media was 
performed as reported previously (Chappell et al., Phytochemistry 26:2259-2260, 1987), 
and the amount of radioactivity incorporated into capsidiol was determined. For these 

25 determinations, samples were separated by TLC, and the zones corresponding to 
capsidiol were scraped from the plate for scintillation counting. 

Inhibition studies were performed by the addition of the P450 inhibitors 
ancymidol (Coolbaugh et al., Plant Physiol. 62:571-576, 1978; Hoshino et al., 
Phytochemistry 38:609-613, 1995) and ketoconazole (Hoshino et al., Phytochemistry 

30 38:609-613, 1995; Rademacher, Annu. Rev. Plant Physiol. Plant Mol. Biol. 51:501-531, 
2000) directly to the cell cultures or enzyme assay mix. Cell cultures were incubated in 
the presence of cellulase (0.5 Hg/ml) and indicated concentrations of ancymidol or 
ketoconazole for 12 hours prior to the addition of [ 3 H]-5-epi-aristoIochene. After a 
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further 3 hour incubation period, the cells and media were collected. The amount of 
radioactivity incorporated into extracellular capsidiol was determined as described 
above. To evaluate secondary effects of these inhibitors, the level of inducible 
sesquiterpene cyclase activity in the collected cells was determined according to (Vogeli 
5 et al., Plant Physiol. 93:1 82-1 87, 1990), as well as in vitro assays with purified 

recombinant EAS (Back et al., Arch. Biochem. Biophys. 31 5:527-532, 1994) incubated 
with the indicated concentrations of ancymidol and ketoconazole. 

All experiments were replicated in several independent trials. While the absolute 
values presented may have varied between experiments by as much as 50%, the trends 
10 and time courses were consistent throughout. 

Construction of an elicitor-induced cDNA library 

Cell cultures were incubated with fungal elicitor (0.5 ug cellulase/ml) for 6 hours 
before collecting the cells by filtration. The cells were kept frozen at -80°C until total 

1 5 RNA was extracted from them using Trizol (Life Technologies, Rockville, MD) 
according to the manufacturer's instructions. Poly.(A) + RNA was purified by two 
rounds of oligo (dT) cellulose column chromatography (Life Technologies, Rockville, 
MD). cDNA synthesis and library construction were subsequently carried out using the 
UNI-ZAP XR library kit (Stratagene, La Jolla, CA), according to manufacturer's 

20 instructions. 

PCR cloning strategy 

Cytochrome P450 cDNA fragments were amplified from the elicitor-induced 
cDNA library using various combinations of degenerate forward and reverse primers 

25 with the vector-specific T3 and T7 primers. The template DNA was prepared from a 

500 \i\ aliquot of the elicitor-induced cDNA library (3 x 10 6 pfu/fil) by heat denaturation 
at 70°C for 10 minutes, followed by phenol/chloroform extraction, ethanol precipitation 
and re-suspension in 500 ul of sterile, deionized water. Amplification reactions were 
performed in 50 ul volumes containing 50 mM KCI; 10 mM Tris-HCl, pH 8.8; 1 .5 mM 

30 MgCl 2 ; 200 ^iM of each dNTP; 2 \i\ template DNA; 20 pmol each of forward and 
reverse primer; and 1 unit Taq Polymerase (Life Technologies, Rockville, MD). 
Reactions were preheated at 94°C for 2 minutes, followed by thirty-five cycles of 
denaturing at 94°C for 1 minute, annealing at 50°C for 1 minute 30 seconds, and 
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polymerization at 72°C for 2 minutes. The reactions were completed by a 10-minute 
extension at 72°C. Aliquots of the reaction products were examined for DNA products 
by agarose gel fractionation, and ligated directly into the pGEM-T Easy vector 
(Promega, Madison, WI). Resulting recombinant plasmids containing insert DNAs 
5 within the expected size range were sequenced using T7 and Sp6 primers. 

DNA sequencing 

All the DNA sequencing reactions were performed using the BIGDYE™ 
Terminator Cycle sequencing kit (Perkin-Elmer, Wellesley, MA) with the sequences 
10 being read on an automated ABI Prism 310 Genetic Analyzer (Applied Biosystems, 
Foster City, CA). Computer assessment of the DNA sequence information was 
performed using the MACVECTOR (Oxford Molecular, Madison, WI) software 
package. 

15 cDNA library screening 

The cDNA library was screened with digoxigenin labeled probes. A 258 bp 
DNA fragment amplified from the pGEM-deg6.4 clone using gene-specific forward (5- 
GGCGGAGAATTTGTCCTGGAATGTCATTTGGTTTAG-3 , (SEQ ID NO: 13)) and 
reverse (5 '-GT AC AATAGTG AGGTTG AC AATG-3' (SEQ ID NO: 14)) primers; and a 

20 374 bp DNA fragment amplified from the pBKS-CYPB3.843 clone with specific 
forward (5'-GGTGGTTGTGAATGCATG-3* (SEQ ID NO: 15)) and reverse (5 1 - 
TTATGCAGCAATAGGCTTGAAGACA-3 , (SEQ ID NO: 16)) primers, were used to 
screen for CYP71Ds. The probes were labeled with digoxigenin- 1 1-dUTP using the 
PCR DIG Labeling Mix (Roche Molecular Biochemicals, Indianapolis, IN), hybridized 

25 to plaque lifts of the cDNA library plated at approximately 10,000 PFUs per 150 mm 
plate, and was hybridization detected with the DIG detection system according to the 
manufacturer's instructions (Roche Molecular Biochemicals, Indianapolis, IN). Plaques 
exhibiting strong hybridization were plaque purified, auto-subcloned to their plasmid 
forms according to the manufacturer's recommendations (Stratagene, La Jolla, CA), and 

30 then subjected to DNA sequencing as described above. 
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RNA analysis 

RNA gel blot analysis was carried out using 10 ug aliquots of total RNA. RNA 
samples were heat-denatured at 70°C for 15 minutes in sample buffer (lx MOPS, 50% 
formamide, 16% formaldehyde, 30 % glycerol, and 3% ethidium bromide), and size 
5 fractionated on a 1.2% agarose gel containing lx MOPS and 18.1% formaldehyde. 

Uniformity of sample loading was determined by visual inspection of the gel for rRNA 
bands. The RNAs were then transferred to a Zeta Probe nylon membrane (Bio-Rad 
Laboratories, Hercules, CA) and hybridized according to the manufacturer's 
recommendations. Full-length cDNAs probes were labeled with [ 32 P]-dCTP (PRIME-IT 
10 Kit, Stratagene, La Jolla, CA) prior to hybridization. After hybridization, the 

membranes were washed in 2x SSC/0.1% SDS once at room temperature followed by 
sequential washes in 0.2x SSC/0.1% SDS at 42°C and 65°C. Hybridization was 
detected with a Phosphoimager (Molecular Dynamics, model 445 SI). 

15 Construction of yeast expression vectors. 

The coding regions of the P450 cDNAs were cloned into the pYeDP60 
expression vector (Urban et al., J. Biol. Chem. 272:19176-19186, 1990; Pompon et al., 
Methods Enzymol. 272:51-64, 1996). Appropriate BamHI, EcoRI, and SstI restriction 
sites (underlined) were introduced via PCR primers containing these sequences either 

20 upstream of the translation start site (ATG) or downstream of the stop codon (TAA or 
TGA). The primers used to amplify the CYP71D20 cDNA were 5- 
GGG GGATCC ATGCAATTCTTCAGCTTGGTTTCC-3' (SEQ ID NO: 17) and 5'- 
GGG GAATTC TTACTCTCGAGAAGGTTGATAAGG-3' (SEQ ED NO: 18); for the 
CYP82E1 cDNA 5'- CCC GGATCC ATGTATCATCTTCTTTCTCCC-3 t (SEQ ID NO: 

25 1 9) and 5'- GGGGAATTCTCAATATTGATAAAGCGTAGGAGG-3' (SEQ ID NO: 
20) ; and for the CYP92A3 cDNA 5'- 

CCC GGATCC ATGCAATCCTTCAGCTTGGTTTCC-3 t (SEQ ID NO: 21) and 5'- 
GGG GAGCTC TCACTCGCAAGAAGATTGATAAGG-3 f (SEQ ID NO: 22). Two 
long, overlapping (italicized) primers 5'- 
30 GCCA TTA TCGGCGCAA TA CTA ATCTCC A A ACTCCGCGGTA AAAA ATTC AAGCT 
CCCACCTGGTCCAACAGCAGTC-3' (SEQ ID NO: 23) and 5'- 

GGG GGATCC ATGGACCTCCTCCTCATAGAAAAAACCCTCGTCGCCTTATTCG 
CCGCCATTATCGGCGCAATACTA-y (SEQ ID NO: 24) coding for the N-terminal 
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sequence of CYP73A1 (GenEMBL Z17369) up to the hinge region were used for the 
modification of the membranfe anchoring segment of CYP73A27 to avoid possible 
problems with intracellular targeting due to the unusual N-terminus (Nedelkina et al., 
1999); the reverse primer used for both amplifications was 5'- 
5 GGG GAGCTC TTATGCAGCAATAGGCTTGAAGAC-3' (SEQ ID NO: 25). 

CYP71D20 and CYP73A27 were amplified using full-length cDNA templates, whereas 
CYP82E1 and CYP92A5 were amplified directly from the cDNA library template. 
Amplifications were performed in 50 u.1 reactions containing lx Pfx amplification 
buffer; 1 mM MgS0 4 ; 300 uM of each dNTP; 10 ng template DNA; 20 pmol each of 

10 forward and reverse primer; and 1.25 units PLATINUM® Pfx Polymerase (Life 
Technologies, Rockville, MD). Reactions were preheated at 94°C for 2 minutes, 
followed by thirty-five cycles of denaturing at 94°C for 15 seconds, annealing at 55°C 
for 30 seconds, and elongating at 68° for 1 .5 minutes. PCR products were ligated into 
the pGEM-T EASY vector (Promega, Madison, WI) and subcloned into the pYeDP60 

1 5 vector. The resulting constructs were validated by a combination of PCR and DNA 
sequencing. 

Yeast expression studies 

Verified pYeDP60- P450 cDNA constructs were introduced into the yeast 

20 WAT1 1 line, a derivative of the W303-1B strain (MAT a; ade 2-1; his 3-11; leu 2-3, - 
1 12; ura 3-1; can R ; cyr + ), provided by Dr. P. Urban (Centre de Genetique Moleculaire, 
CNRS, Gif-sur-Yvette, France). The endogenous NADPH-cytochrome P450 reductase 
(CPR1) locus has been replaced with ATR1, a NADPH-cytochrome P450 reductase 
from Arabidops is thaliana (Pompon et al., Methods Enzymol. 272:51-64, 1996; Urban 

25 etal.,J. Biol. Chem. 272:19176-19186, 1997), in the WAT11 line. Yeast was grown 
overnight in a 30°C shaker in YPAD (lg/1 yeast extract; 1 g/1 peptone; 20 g/1 glucose; 
200 mg/1 adenine) liquid media. Cultures were harvested at an Ar,oo between 0.5 and 1.5. 
Cells were collected by centrifugation at 2,500 x g for 5 minutes at 4°C, and resuspended 
in ice-cold, sterile dH20. Cells were pelleted again as above and resuspended in 1 M 

30 sorbitol. Forty u.1 of yeast suspension was mixed with 0.5 to 1 ug plasmid DNA (in <5 
u.1 dH 2 0) in a pre-chilled 0.5 ml tube, and transferred to a chilled cuvette with a 0.2 cm 
electrode gap. One pulse at 1 .5 kV, 25 uF, and 200 Ohms was applied by an Eppendorf 
Electroporator (model 2510). A mixture of 500 ul of YPAD/1 M sorbitol was 
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immediately added to the electroporated cells. Cells were allowed to recover at 30°C for 
1 hour, then spread onto SGI plates (1 g/1 bactocasamino acids; 7 g/1 yeast nitrogen base; 
20 g/1 glucose; 20 mg/1 tryptophan; and 20 g/1 agar). Transformed colonies appeared 
after 3 to 6 days of incubation at 30°C. Recombinant plasmids were confirmed by PCR 
5 assays performed directly on randomly selected yeast colonies. 

For expression studies, one colony was added to SGI media (1 g/1 bactocasamino 
acids; 7 g/1 yeast nitrogen base; 20 g/1 glucose; and 20 mg/1 tryptophan) and grown at 
30°C for approximately 24 hours. An aliquot of this culture was diluted 1 :50 into 250 
ml of YPGE (10 g/1 bactopeptone; 10 g/1 yeast extract; 5 g/1 glucose; and 3% ethanol by 

10 volume) and the cells were grown until all glucose was consumed. The absence of 
glucose was determined by placing a 200 |il aliquot of culture into a 1 .5 ml tube, 
inserting a DIASTIX urinalysis reagent strip (Bayer, Elkhart, IN) for 30 seconds, and 
observing colorimetric changes indicating glucose levels. Induction was initiated by the 
addition of 5 grams of galactose (final concentration of 2%). The cultures were 

1 5 maintained at 30°C for an additional 1 6 hours before collecting the cells by 

centrifugation at 7,000 x g for 10 minutes. The pelleted cells were washed with 100 ml 
of TES buffer (50 mM Tris-HCl pH, 7.5; 1 mM EDTA; 0.6 M sorbitol). The cells were 
centrifuged as above, resuspended in 100 ml of TES-M (TES supplemented with 10 mM 
2-mercaptoethanol), and allowed to incubate at room temperature for 10 minutes. The 

20 yeast cells were centrifuged again at 7,000 x g for 10 minutes, and the pellet was 

resuspended in 2.5 ml extraction buffer (1% bovine serum albumin, fraction V; 2 mM 2- 
mercaptoethanol; 1 mM phenylmethylsulfonyl fluoride, all dissolved in TES). Glass 
beads (0.5 mm in diameter, Biospec Products, Inc., Bartlesville, OK) were added until 
skimming the surface of the cell suspension. Cell walls were disrupted manually by hand 

25 shaking in a cold room for 10 min at 30 second intervals separated by 30 second 
intervals on ice. Cell extracts were transferred to a 50 ml centrifuge tube, the glass 
beads were washed three times with 5 ml of extraction buffer, and the washes were 
pooled with the original cell extracts. Microsomes were prepared by differential 
centrifugation at 10,000 g for 10 minutes at 4°C to remove cellular debris, followed by 

30 centrifugation at 100,000 x g for 70 minnutes at 4°C, and microsomal pellets were 
resuspended in 1.5 ml TEG-M buffer (50 mM Tris-HCl, pH 7.5; 1 mM EDTA; 20% 
glycerol; and 1.5 mM 2-mercaptoethanol) and stored frozen at -80°C until further 
assayed. 
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CO difference spectra 

Fe 2+ • CO vs. Fe 2+ difference spectroscopy (Omura and Sato, J. Biol. Chem. 
239:2370-2378, 1964) was performed using 0.4 ml of microsomes suspended in L6 ml 
5 of 50 mM Tris-HCl, pH 7.5; 1 mM EDTA; and 20% glycerol. A small amount of the 
reducing agent, sodium dithionite, was added, and the mixture was distributed between 
two cuvettes. A baseline was recorded between 400 and 500 nm on a Perkin Elmer 
Lambda 18 UV/visible spectrophotometer. CO was then bubbled into the sample 
cuvette for 1 minute, and the difference spectrum recorded again. The amount of 
10 functional P450 was estimated based on an absorbance coefficient of 9 1 mM" 1 • cm" 1 . 

5-epi-aristolochene-l, 3-hydroxylase assays 

5-epi-aristolochene-l, 3-hydroxylase assays were performed in 0.5 ml 
polyethylene tubes in 100 jil volumes. 5-epi-aristolochene or 1 -deoxycapsidiol 

15 dissolved in hexane was added to the tube, and the organic solvent was removed by 
incubation of the open tube at 30°C. 5-epi-aristolochene and 1 -deoxycapsidiol were 
resuspended in 2 jil dimethyl sulfoxide before adding the reaction mixture. Reactions 
were carried out in 100 mM Tris-HCl, pH 7.5, to which microsomal protein was added 
to a final concentration of 1 mg/ml. Reactions were initiated by the addition of 2 mM 

20 NADPH. The final concentration of 5-epi-aristolochene and 1 -deoxycapsidiol in these 
assays varied from 20 to 50 jiM. After incubations for variable lengths of time at 30°C, 
the reactions were extracted with two volumes of ethyl acetate. The organic extracts 
were concentrated and evaluated by GC and GC-MS along with standards of 5-epi- 
aristolochene (Whitehead et al., Phytochemistry 28:775-779, 1989; Rising et al., J. Am. 

25 Chem. Soc. 122:1861-1866, 2000), 1 -deoxycapsidiol (Whitehead et al., Phytochemistry 
29:479-182, 1990), and capsidiol (Whitehead et al., Phytochemistry 26:1367-1369, 
1987; Milatetal., Phytochemistry 30:2171-2173, 1991). GC analysis was routinely 
performed with an HP5890 GC equipped with a Hewlett-Packard HP-5 capillary column 
(30 m x 0.25 mm, 0.25 Jim phase thickness) and FID as described previously (Rising et 

30 al., J. Am. Chem. Soc. 122:1861-1866, 2000). GC-MS analysis was performed at the 
University of Kentucky Mass Spectrometry Facility using a Varian 3400 gas 
chromatograph and a Finnigan INCOS 50 quadrupole mass selective detector. The GC 
was equipped with a J&W DB-5ms capillary column (15 m x 0.25 mm, 0.25 jim phase 
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thickness) and run with He as the carrier gas (10 psi.). Splitless injections were done at 
an injection port temperature of 280°C. The column temperature was maintained at 
40°C for 1 minute and then increased to 280°C at 10°C per minute. Following 
separation by the GC column, samples were introduced directly into the electron impact 
5 ionization source. Mass spectra were acquired at 70 eV, scanning from 40-440 Da in 1 
second. 

Production of cytochrome P450s 

Using the standard molecular techniques described herein, the isolation of 

10 additional cytochrome P450 coding sequences is readily accomplished. For example, 
using all or a portion of the amino acid sequence of any of the disclosed P450s, one may 
readily design P450-specific oligonucleotide probes, including P450 degenerate 
oligonucleotide probes (i.e., a mixture of all possible coding sequences for a given 
amino acid sequence). These oligonucleotides may be based upon the sequence of either 

15 DNA strand and any appropriate portion of the P450 nucleotide sequence. General 

methods for designing and preparing such probes are provided, for example, in Ausubel 
et al., 2000, Current Protocols in Molecular Biology, Wiley Interscience, New York, and 
Berger and Kimmel, Guide to Molecular Cloning Techniques, 1987, Academic Press, 
New York. These oligonucleotides are useful for P450 gene isolation, either through 

20 their use as probes capable of hybridizing to a P450 complementary sequence, or as 
primers for various amplification techniques, for example, polymerase chain reaction 
(PCR) cloning strategies. 

Hybridization techniques and screening procedures are well known to those 
skilled in the art and are described, for example, in Ausubel et al. (supra); Berger and 

25 Kimmel (supra); Chen et al., Arch. Biochem. Biophys. 324:255, 1995; and Sambrook et 
al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
New York. If desired, a combination of different oligonucleotide probes may be used 
for the screening of a recombinant DNA library. The oligonucleotides may be 
detectably-labeled using methods known in the art and used to probe filter replicas from 

30 a recombinant DNA library. Recombinant DNA libraries are prepared according to 
methods well known in the art, for example, as described in Ausubel et al. (supra), or 
they may be obtained from commercial sources. 
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As discussed above, P450 oligonucleotides may also be used as primers in a 
polymerase chain reaction (PCR) amplification cloning strategy. PCR methods are well 
known in the art and are described, for example, in PCR Technology, Erlich, ed., 
Stockton Press, London, 1989; PCR Protocols: A Guide to Methods and Applications, 
5 Innis et al., eds., Academic Press, Inc., New York, 1 990; and Ausubel et al. (supra). 

Primers are optionally designed to allow cloning of the amplified product into a suitable 
vector, for example, by including appropriate restriction sites at the 5* and 3' ends of the 
amplified fragment (as described herein). If desired, a P450 gene may be isolated using 
the PCR "RACE" technique, or Rapid Amplification of cDNA Ends (see, e.g., Innis et 

10 al. (supra)). By this method, oligonucleotide primers based on a P450 sequence are 

oriented in the 3' and 5* directions and are used to generate overlapping PCR fragments. 
These overlapping 3'- and 5*-end RACE products are combined to produce an intact full- 
length cDNA. This method is described in Innis et al. (supra); and Frohman et al., Proc. 
Natl. Acad. Sci. USA 85:8998, (1988). 

15 Additional methods for identifying sequences encoding P450s are provided in 

Maughan etal. (Arch. Biochem. Biophys. 341:104-1 11, 1997) and Clark etal. (Plant 
Mol. Biol. 33:875-885, 1997). 

Useful P450 sequences may be isolated from any appropriate organism. 
Confirmation of a sequence's relatedness to a P450 polypeptide disclosed herein may be 

20 accomplished by a variety of conventional methods, for example, by comparing the 

sequence with a known p450 sequence found in a database. In addition, the activity of 
any P450 may be evaluated according to any of the techniques described herein. 

P450 polypeptide expression 

25 P450 polypeptides may be produced by transformation of a suitable host cell 

with all or part of a P450 DNA (for example, any one of the P450 cDNAs described 
herein) in a suitable expression vehicle or with a plasmid construct engineered for 
increasing the expression of a P450 polypeptide in vivo. 

Those skilled in the field of molecular biology will appreciate that any of a wide 

30 variety of expression systems may be used to provide the recombinant protein. The 

precise host cell used is not critical to the invention. The P450 protein may be produced 
in a prokaryotic host, for example, £. coli TBI, or in a eukaryotic host, for example, 
Saccharomyces cerevisiae, insect cells, mammalian cells (for example, COS 1 or NIH 
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3T3 cells), or any of a number of plant cells including, without limitation, algae, tree 
species, ornamental species, temperate fruit species, tropical fruit species, vegetable 
species, legume species, monocots, dicots, or in any plant of commercial or agricultural 
significance. Particular examples of suitable plant hosts include, but are not limited to, 
5 Conifers, Petunia, Tomato, Potato, Tobacco, Grape, Arabidopsis, Lettuce, Sunflower, 
Oilseed rape, Flax, Cotton, Sugarbeet, Celery, Soybean, Alfalfa, Medicago, Lotus, 
Vigna, Cucumber, Carrot, Eggplant, Cauliflower, Horseradish, Morning Glory, Poplar, 
Walnut, Apple, Asparagus, Grape, Rice, Maize, Millet, Onion, Barley, Orchard grass, 
Oat, Rye, Tobacco, and Wheat. 

10 Such cells are available from a wide range of sources including: the American 

Type Culture Collection (Rockland, Md.); or from any of a number seed companies, for 
example, W. Atlee Burpee Seed Co. (Warminster, Pa.), Park Seed Co. (Greenwood, 
S.C.), Johnny Seed Co. (Albion, Me.), or Northrup King Seeds (Harstville, S.C.). 
Descriptions and sources of useful host cells are also found in Vasil I. K., Cell Culture 

15 and Somatic Cell Genetics of Plants, Vol I, II, III Laboratory Procedures and Their 
Applications Academic Press, New York, 1984; Dixon, R. A., Plant Cell Culture— A 
Practical Approach, IRL Press, Oxford University, 1985; Green et al., Plant Tissue and 
Cell Culture, Academic Press, New York, 1987; and Gasser and Fraley, Science 
244:1293, (1989). 

20 For prokaryotic expression, DNA encoding a P450 polypeptide is carried on a 

vector operably linked to control signals capable of effecting expression in the 
prokaryotic host. If desired, the coding sequence may contain, at its 5* end, a sequence 
encoding any of the known signal sequences capable of effecting secretion of the 
expressed protein into the periplasmic space of the host cell, thereby facilitating 

25 recovery of the protein and subsequent purification. Prokaryotes most frequently used 
are various strains of E. coli; however, other microbial strains may also be used. 
Plasmid vectors are used which contain replication origins, selectable markers, and 
control sequences derived from a species compatible with the microbial host. Examples 
of such vectors are found in Pouwels et al. (supra) or Ausubel et al. (supra). Commonly 

30 used prokaryotic control sequences (also referred to as "regulatory elements") are 
defined herein to include promoters for transcription initiation, optionally with an 
operator, along with ribosome binding site sequences. Promoters commonly used to 
direct protein expression include the beta- lactamase (penicillinase), the lactose (lac), the 
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tryptophan (Trp) (Goeddel et al., Nucl. Acids Res. 8:4057 (1980)), and the tac promoter 
systems, as well as the lambda-derived P.sub.L promoter and N-gene ribosome binding 
site (Simatake et al, Nature 292:128 (1981)). 

One particular bacterial expression system for P450 production is the E. coli pET 
5 expression system (Novagen). According to this expression system, DNA encoding a 
P450 is inserted into a pET vector in an orientation designed to allow expression. Since 
the P450 gene is under the control of the T7 regulatory signals, P450 expression is 
dependent on inducing the expression of T7 RNA polymerase in the host cell. This is 
typically achieved using host strains which express T7 RNA polymerase in response to 

10 IPTG induction. Once produced, recombinant P450 is then isolated according to 
standard methods known in the art, for example, those described herein. 

Another bacterial expression system for P450 production is the pGEX expression 
system (Pharmacia). This system employs a GST gene fusion system that is designed 
for high-level expression of a gene or gene fragment as a fusion protein with rapid 

15 purification and recovery of the functional gene product The P450 of interest is fused 
to the carboxyl terminus of the glutathione S-transferase protein from Schistosoma 
japonicum and is readily purified from bacterial ly sates by affinity chromatography 
using Glutathione Sepharose 4B. Fusion proteins can be recovered under mild 
conditions by elution with glutathione. Cleavage of the glutathione S-transferase 

20 domain from the fusion protein is facilitated by the presence of recognition sites for site- 
specific proteases upstream of this domain. For example, proteins expressed in pGEX- 
2T plasmids may be cleaved with thrombin; those expressed in pGEX-3X may be 
cleaved with factor Xa. 

Other prokaryotic systems useful for expressing eukaryotic P450s are described 

25 by Cooper (Mutat. Res. 454:45-52, 2000) and Dong et al. (Arch. Biochem. Biophys. 

327:254-259, 1996). In addition, strategies for enhancing the prokaryotic expression of 
a cytochrome P450 in combination with cytochrome reductase are described in Porter et 
al. (Drug. Metab. Rev. 31:159-174, 1999). 

For eukaryotic expression, the method of transformation or transfection and the 

30 choice of vehicle for expression of the P450 will depend on the host system selected. 
Transformation and transfection methods of numerous organisms, for example, the 
baker's yeast Saccharomyces cerevisiae, are described, e.g., in Ausubel et al. (supra); 
Weissbach and Weissbach, Methods for Plant Molecular Biology, Academic Press, 



27 



WO 02/072758 



PCT/US02/06912 



1989; Gelvin et al., Plant Molecular Biology Manual, Kluwer Academic Publishers, 
1990; Kindle, K., Proc. Natl. Acad. Sci. U.S.A. 87:1228 (1990); Potrykus, I., Annu. Rev. 
Plant Physiol. Plant Mol. Biology 42:205 (1991); and BioRad (Hercules, Calif.) 
Technical Bulletin #1687 (Biolistic Particle Delivery Systems). Expression vehicles 
5 may be chosen from those provided, e.g., in Cloning Vectors: A Laboratory Manual (P. 
H. Pouwels et al., 1985, Supp. 1987); Gasser and Fraley (supra); Clontech Molecular 
Biology Catalog (Catalog 1992/93 Tools for the Molecular Biologist, Palo Alto, Calif.); 
and the references cited above. 

One preferred eukaryotic expression system is the mouse 3T3 fibroblast host cell 

10 transfected with a pMAMneo expression vector (Clontech). pMAMneo provides: an 
RSV-LTR enhancer linked to a dexamethasone-inducible MMTV-LTR promoter, an 
SV40 origin of replication which allows replication in mammalian systems, a selectable 
neomycin gene, and SV40 splicing and polyadenylation sites. DNA encoding a P450 is 
inserted into the pMAMneo vector in an orientation designed to allow expression. The 

1 5 recombinant P450 is then isolated as described below. Other preferable host cells which 
may be used in conjunction with the pMAMneo expression vehicle include COS cells 
and CHO cells (ATCC Accession Nos. CRL 1650 and CCL 61, respectively). 

Alternatively, if desired, a P450 is produced by a stably-transfected mammalian 
cell line. A number of vectors suitable for stable transfection of mammalian cells are 

20 available to the public, e.g., see Pouwels et al. (supra); methods for constructing such 
cell lines are also publicly available, e.g., in Ausubel et al. (supra). In one example, 
cDNA encoding the P450 is cloned into an expression vector which includes the 
dihydrofolate reductase (DHFR) gene. Integration of the plasmid and, therefore, the 
P450-encoding gene into the host cell chromosome is selected for by inclusion of 0.01- 

25 300 uM methotrexate in the cell culture medium (as described in Ausubel et al., supra). 
This dominant selection can be accomplished in most cell types. Recombinant protein 
expression can be increased by DHFR-mediated amplification of the transfected gene. 
Methods for selecting cell lines bearing gene amplifications are described in Ausubel et 
al. (supra); such methods generally involve extended culture in medium containing 

30 gradually increasing levels of methotrexate. DHFR-containing expression vectors 

commonly used for this purpose include pCVSEII-DHrF and pAdD26SV(A) (described 
in Ausubel et al., supra). Any of the host cells described above or, preferably, a DHFR- 
deficient CHO cell line (for example, CHO DHFR cells, ATCC Accession Number CRL 
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9096) are among the host cells preferred for DHFR selection of a stably-transfected cell 
line or DHFR-mediated gene amplification. 

A cytochrome P450 may also be produced in insect cells, such cells include, 
without limitation, Spodoptera frugiperda (Sf)-9, Sf-21, or Drosophila melanogaster 
5 Schneider (SL-2) cells. For P450 production, insect cells are typically infected with a 
baculovirus, for example, Autographa californica Multiple Nuclear Polyhedrosis Virus 
(AcMNPV) containing an expression cassette for such a protein, e.g., cytochrome P450, 
at a multiplicity of infection of 1 to 10. The infected cells are generally cultured in a 
standard insect cell culture medium for 24 to 48 hours prior to recovering the protein 

1 0 using standard molecular biology techniques. If desired, a P450 polypeptide may also 
be produced in insect cells directly transfected with a DNA construct containing an 
expression cassette encoding the P450. 

Furthermore, any of the cytochrome P450s described herein may be produced in 
yeast, for example, Pichia pastoris. In order to produce the P450, yeast cells are 

1 5 transformed with an expression cassette containing, for example, a promoter such as the 
AOX1 or phosphoglycerate kinase gene promoter, the P450 gene to be expressed, and a 
terminator. Such an expression cassette may contain an origin of replication or it may 
be integrated into the yeast genomic DNA. The expression cassette is generally 
introduced by lithium acetate transformation or by the use of spheroplasts. In order to 

20 select for successfully transformed cells, the yeast are plated, for example, on minimal 
media which only allows yeast carrying the introduced expression cassette to grow. 

In addition, expression of recombinant proteins in yeast using a Hansenula 
polymorpha expression system is described in U.S. Patent Nos. 5,741,674 and 
5,672,487. 

25 A P450 may also be produced by a stably-transfected plant cell line or by a 

transgenic plant. Such genetically-engineered plants are useful for a variety of industrial 
and agricultural applications as discussed below. Importantly, this invention is 
applicable to gymnosperms and angiosperms, and will be readily applicable to any new 
or improved transformation or regeneration method. 

30 A number of vectors suitable for stable transfection of plant cells or for the 

establishment of transgenic plants are available to the public; such vectors are described 
in Pouwels et al. (supra), Weissbach and Weissbach (supra), and Gelvin et al. (supra). 
Methods for constructing such cell lines are described in, e.g., Weissbach and 
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Weissbach (supra), and Gelvin et al. (supra). Typically, plant expression vectors include 
(1) a cloned P450 gene under the transcriptional control of 5' and 3* regulatory sequences 
and (2) a dominant selectable marker. Such plant expression vectors may also contain, 
if desired, a promoter regulatory region (for example, one conferring inducible or 
5 constitutive expression, or environmentally- or developmentally-regulated, or pathogen- 
or wound-inducible, or cell- or tissue-specific expression), a transcription initiation start 
site, a ribosome binding site, an RNA processing signal, a transcription termination site, 
and/or a polyadenylation signal. 

The P450 DNA sequence of the invention may, if desired, be combined with other 

10 DNA sequences in a variety of ways. The P450 DNA sequence of the invention may be 
employed with all or part of the gene sequences normally associated with a P450. In its 
component parts, a DNA sequence encoding a P450 is combined in a DNA construct 
having a transcription initiation control region capable of promoting transcription and 
translation in a host cell. 

15 In general, the constructs will involve regulatory regions functional in plants 

which provide for production of a P450 as discussed herein. The open reading frame 
coding for the P450, or a functional fragment thereof, will be joined at its 5' end to a 
transcription initiation regulatory region such as the sequence naturally found in the 5' 
upstream region of a P450 structural gene, for example, a CYP71D20 (SEQ ID NO:2) or 

20 CYP71D21 (SEQ ID NO:4) gene. Numerous other transcription initiation regions are 
available which provide for constitutive or inducible regulation. 

For applications when developmental, cell, tissue, hormonal, environmental, or 
pathogen-inducible expression are desired, appropriate 5' upstream non-coding regions 
are obtained from other genes; for example, from genes regulated during seed 

25 development, embryo development, leaf development, or in response to a pathogen. 

Regulatory transcript termination regions may also be provided in DNA constructs 
of this invention as well. Transcript termination regions may be provided by the DNA 
sequence encoding a P450 or any convenient transcription termination region derived 
from a different gene source. The transcript termination region will contain preferably 

30 at least 1-3 kb of sequence 3' to the structural gene from which the termination region is 
derived. 
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An example of a useful plant promoter according to the invention is a 
caulimovirus promoter, such as, a cauliflower mosaic virus (CaMV) promoter. These 
promoters confer high levels of expression in most plant tissues, and the activity of these 
promoters is not dependent on virally encoded proteins. CaMV is a source for both the 
5 35S and 19S promoters. In most tissues of transgenic plants, the CaMV 35S promoter is 
a strong promoter (see, e.g., Odell et al., Nature 313:810 (1985)). The CaMV promoter 
is also highly active in monocots (see, e.g., Dekeyser et al., Plant Cell 2:591 (1990); 
Terada and Shimamoto, Mol. Gen. Genet. 220:389, (1990)). Moreover, activity of this 
promoter can be further increased (i.e., between 2-10 fold) by duplication of the CaMV 
10 35S promoter (see e.g., Kay et al., Science 236:1299 (1987); Ow et al., Proc. Natl. Acad. 
Sci. U.S.A. 84:4870 (1987); and Fang etal., Plant Cell 1:141 (1989)). Other useful plant 
promoters include, without limitation, the nopaline synthase promoter (An et al., Plant 
Physiol. 88:547 (1988)) and the octopine synthase promoter (Fromm et al., Plant Cell 
1:977(1989)). 

1 5 For certain applications, it may be desirable to produce the P450 gene product in 

an appropriate tissue, at an appropriate level, or at an appropriate developmental time. 
For this purpose, there is an assortment of gene promoters, each with its own distinct 
characteristics embodied in its regulatory sequences, which have been shown to be 
regulated in response to the environment, hormones, and/or developmental cues. These 

20 include gene promoters that are responsible for heat-regulated gene expression (see, e.g., 
Callis et al., Plant Physiol. 88:965 (1988); Takahashi and Komeda, Mol. Gen. Genet. 
219:365 (1989); and Takahashi etal., Plant J. 2:751(1992)), light-regulated gene 
expression (e.g., the pea rbcS-3A described by Kuhlemeier et al. (Plant Cell 1 :471 
(1989); the maize rbcS promoter described by Schaffher and Sheen, (Plant Cell 3:997 

25 (1 991); or the chlorophyll a/b-binding protein gene found in pea described by Simpson 
et al. (EMBO J. 4:2723 (1985)), hormone-regulated gene expression (for example, the 
abscisic acid (ABA) responsive sequences from the Em gene of wheat described by 
Marcotte et al. (Plant Cell 1 :969 (1989); the ABA-inducible HVA1 and HVA22, and the 
rd29A promoters described for barley and Arabidopsis by Straub et al. (Plant Cell 6:617 

30 (1994), Shen et al. (Plant Cell 7:295 (1994)), and wound-induced gene expression (for 
example, of wunl described by Siebertz et al. (Plant Cell 1 :961 (1989)), or organ- 
specific gene expression (for example, of the tuber-specific storage protein gene 
described by Roshal et al. (EMBO J. 6: 1 1 55 (1 987); the 23-kDa zein gene from maize 
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described by Schernthaner et al. (EMBO J. 7:1249 (1988); or the French bean beta- 
phaseolin gene described by Bustos et al., (Plant Cell 1 :839 (1989)); and pathogen- 
inducible gene expression described by Chappell et al. in U.S. Ser. Nos. 08/471,983, 
08/443,639, and 08/577,483, hereby incorporated by reference. 
5 Plant expression vectors may also optionally include RNA processing signals, for 

example, introns, which have been shown to be important for efficient RNA synthesis 
and accumulation (Callis et al., Genes and Dev. 1 : 1 1 83 (1987)). The location of the 
RNA splice sequences can dramatically influence the level of transgene expression in 
plants. In view of this fact, an intron may be positioned upstream or downstream of a 

10 P450-encoding sequence in the transgene to modulate levels of gene expression. 

In addition to the aforementioned 5' regulatory control sequences, the expression 
vectors may also include regulatory control regions which are generally present in the 3* 
regions of plant genes (Thornburg et al., Proc. Natl. Acad. Sci. U.S.A. 84:744 (1987); 
An et al., Plant Cell 1:115 (1989)). For example, the 3* terminator region may be 

1 5 included in the expression vector to increase stability of the mRNA. One such 

terminator region may be derived from the PI-II terminator region of potato. In addition, 
other commonly used terminators are derived from the octopine or nopaline synthase 
signals. 

The plant expression vector also typically contains a dominant selectable marker 
20 gene used to identify those cells that have become transformed. Useful selectable genes 
for plant systems include genes encoding antibiotic resistance genes, for example, those 
encoding resistance to hygromycin, kanamycin, bleomycin, G418, streptomycin, or 
spectinomycin. Genes required for photosynthesis may also be used as selectable 
markers in photosynthetic-deficient strains. Alternatively, the green-fluorescent protein 
25 from the jellyfish Aequorea victoria may be used as a selectable marker (Sheen et al., 
Plant J. 8:777, 1995; Chiu et al., Current Biology 6:325 (1996)). Finally, genes 
encoding herbicide resistance may be used as selectable markers; useful herbicide 
resistance genes include the bar gene encoding the enzyme phosphinothricin 
acetyltransferase and conferring resistance to the broad-spectrum herbicide BASTA 
30 (Hoechst AG, Frankfurt, Germany). 

Efficient use of selectable markers is facilitated by a determination of the 
susceptibility of a plant cell to a particular selectable agent and a determination of the 
concentration of this agent which effectively kills most, if not all, of the transformed 
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cells. Some useful concentrations of antibiotics for tobacco transformation include, e.g., 
75-100 ug/ml (kanamycin), 20-50 jig/ml (hygromycin), or 5-10 ug/ml (bleomycin). A 
useful strategy for selection of transformants for herbicide resistance is described, e.g., 
by Vasil et al., supra. 

5 It should be readily apparent to one skilled in the art of molecular biology, 

especially in the field of plant molecular biology, that the level of gene expression is 
dependent, not only on the combination of promoters, RNA processing signals, and 
terminator elements, but also on how these elements are used to increase the levels of 
selectable marker gene expression. 

10 

Plant transformation 

Upon construction of the plant expression vector, several standard methods are 
available for introduction of the vector into a plant host, thereby generating a transgenic 
plant. These methods include (1) Agrobacterium-mediated transformation (A. 

15 tumefaciens or A. rhizogenes) (see, e.g., Lichtenstein and Fuller, In: Genetic 

Engineering, vol. 6, PWJ Rigby, ed, London, Academic Press, 1987; and Lichtenstein, 
C. P., and Draper, J., In: DNA Cloning, Vol II, D. M. Glover, ed, Oxford, IRI Press, 
1985)), (2) the particle delivery system (see, e.g., Gordon-Kamm et al., Plant Cell 2:603 
(1990); orBioRad Technical Bulletin 1687, supra), (3) microinjection protocols (see, 

20 e.g., Green et al., supra), (4) polyethylene glycol (PEG) procedures (see, e.g., Draper et 
al., Plant Cell Physiol. 23:451 (1982); or e.g., Zhang and Wu, Theor. Appl. Genet. 
76:835 (1988)), (5) liposome-mediated DNA uptake (see, e.g., Freeman et al., Plant Cell 
Physiol. 25:1353 (1984)), (6) electroporation protocols (see, e.g., Gelvin et al., supra; 
Dekeyser et al., supra; Fromm et al., Nature 319:791 (1986); Sheen, Plant Cell 2:1027 

25 (1990); or Jang and Sheen, Plant Cell 6:1665 (1994)), and (7) the vortexing method (see, 
e.g., Kindle, supra). The method of transformation is not critical to the present 
invention. Any method which provides for efficient transformation may be employed. 
As newer methods are available to transform crops or other host cells, they may be 
directly applied. 

30 The following is an example outlining one particular technique, an 

Agrobacterium-mediated plant transformation. By this technique, the general process 
for manipulating genes to be transferred into the genome of plant cells is carried out in 
two phases. First, cloning and DNA modification steps are carried out in E. coli, and the 
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plasmid containing the gene construct of interest is transferred by conjugation or 
electroporation into Agrobacterium. Second, the resulting Agrobacterium strain is used 
to transform plant cells. Thus, for the generalized plant expression vector, the plasmid 
contains an origin of replication that allows it to replicate in Agrobacterium and a high 
5 copy number origin of replication functional in E. coli. This permits facile production 
and testing of transgenes in E coli prior to transfer to Agrobacterium for subsequent 
introduction into plants. Resistance genes can be carried on the vector, one for selection 
in bacteria, for example, streptomycin, and another that will function in plants, for 
example, a gene encoding kanamycin resistance or herbicide resistance. Also present on 

10 the vector are restriction endonuclease sites for the addition of one or more transgenes 
and directional T-DNA border sequences which, when recognized by the transfer 
functions of Agrobacterium, delimit the DNA region that will be transferred to the plant. 

In another example, plant cells may be transformed by shooting into the cell 
tungsten microprojectiles on which cloned DNA is precipitated. In the Biolistic 

15 Apparatus (Bio-Rad) used for the shooting, a gunpowder charge (22 caliber Power 

Piston Tool Charge) or an air-driven blast drives a plastic macroprojectile through a gun 
barrel. An aliquot of a suspension of tungsten particles on which DNA has been 
precipitated is placed on the front of the plastic macroprojectile. The latter is fired at an 
acrylic stopping plate that has a hole through it that is too small for the macroprojectile 

20 to pass through. As a result, the plastic macroprojectile smashes against the stopping 
plate, and the tungsten microprojectiles continue toward their target through the hole in 
the plate. For the present invention, the target can be any plant cell, tissue, seed, or 
embryo. The DNA introduced into the cell on the microprojectiles becomes integrated 
into either the nucleus or the chloroplast. 

25 In general, transfer and expression of transgenes in plant cells are now routine 

practices to those skilled in the art, and have become major tools to carry out gene 
expression studies in plants and to produce improved plant varieties of agricultural or 
commercial interest. 



30 Transgenic plant regeneration 

Plants cells transformed with plant expression vectors can be regenerated, for 
example, from single cells, callus tissue, or leaf discs according to standard plant tissue 
culture techniques. It is well known in the art that various cells, tissues, and organs from 
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almost any plant can be successfully cultured to regenerate an entire plant; such 
techniques are described, e.g., in Vasil supra; Green et al., supra; Weissbach and 
Weissbach, supra; and Gelvin et al., supra. 

In one particular example, a cloned P450, under the control of the EAS4 promoter 
5 and the nopaline synthase terminator and carrying a selectable marker (for example, 

kanamycin resistance), is transformed into Agrobacterium. Transformation of leaf discs 
(for example, of tobacco leaf discs), with vector-containing Agrobacterium is carried out 
as described by Horsch et al. (Science 227:1229 (1985)). Putative transformants are 
selected after a few weeks (for example, 3 to 5 weeks) on plant tissue culture media 

10 containing kanamycin (e.g., 100 ng/ml). Kanamyc in-resistant shoots are then placed on 
plant tissue culture media without hormones for root initiation. Kanamycin-resistant 
plants are then selected for greenhouse growth. If desired, seeds from self-fertilized 
transgenic plants can then be sowed in soil-less medium and grown in a greenhouse. 
Kanamycin-resistant progeny are selected by sowing surface sterilized seeds on 

1 5 hormone-free kanamycin-containing media. Analysis for the integration of the 

transgene is accomplished by standard techniques (see, for example, Ausubel et al., 
supra; Gelvin et al., supra). 

Transgenic plants expressing the selectable marker are then screened for 
transmission of the transgene DNA by standard immunoblot and DNA detection 

20 techniques. Each positive transgenic plant and its transgenic progeny is unique in 

comparison to other transgenic plants established with the same transgene. Integration 
of the transgene DNA into the plant genomic DNA is in most cases random, and the site 
of integration can profoundly affect the levels and the tissue and developmental patterns 
of transgene expression. Consequently, a number of transgenic lines are usually 

25 screened for each transgene to identify and select plants with the most appropriate 
expression profiles. 

Transgenic lines are generally evaluated for levels of transgene expression. 
Expression at the RNA level is determined initially to identify and quantitate expression- 
positive plants. Standard techniques for RNA analysis are employed and include PCR 

30 amplification assays using oligonucleotide primers designed to amplify only transgene 
RNA templates and solution hybridization assays using transgene-specific probes (see, 
e.g., Ausubel et al., supra). The RNA-positive plants are then analyzed for protein 
expression by Western immunoblot analysis using specific antibodies to the P450 (see, 
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e.g., Ausubel et aL, supra). In addition, in situ hybridization and immunocytochemistry 
according to standard protocols can be done using transgene-specific nucleotide probes 
and antibodies, respectively, to localize sites of expression within transgenic tissue. 

Once the recombinant P450 is expressed in any cell or in a transgenic plant (for 
5 example, as described above), it may be isolated, e.g., using affinity chromatography. In 
one example, an anti-P450 antibody (e.g., produced as described in Ausubel et aL, supra, 
or by any standard technique) may be attached to a column and used to isolate the 
polypeptide. Lysis and fractionation of P450-producing cells prior to affinity 
chromatography may be performed by standard methods (see, e.g., Ausubel et al., 
10 supra). Once isolated, the recombinant protein can, if desired, be further purified, for 
example, by high performance liquid chromatography (see, e.g., Fisher, Laboratory 
Techniques in Biochemistry and Molecular Biology, eds., Work and Burdon, Elsevier, 
1980). 

These general techniques of polypeptide expression and purification can also be 
15 used to produce and isolate useful P450 fragments or analogs. 



Use 

The aforementioned cytochrome P450 polypeptides of the invention are useful in 
the biosynthesis of hormones, lipids, and secondary metabolites, and may also help 
20 plants tolerate potentially harmful exogenous chemicals such as herbicides, pesticides, 
and pollutants. In addition, such cytochrome P450 polypeptides are useful in the 
chemical defense of plants against insects, as well as against bacterial, viral, and fungal 
infection. 



25 Engineering plant disease resistance 

Plasmid constructs designed for the expression of a P450 gene product are 
useful, for example, for activating plant defense pathways that confer anti-pathogenic 
properties to a transgenic plant, for example, the production of phytoalexins. P450 
genes that are isolated from a host plant (e.g., Nicotiana) may be engineered for 

30 expression in the same plant, a closely related species, or a distantly related plant 
species. For example, a P450 gene may be engineered for constitutive low-level 
expression and then transformed into a Nicotiana host plant. Alternatively, the P450 
gene may be engineered for expression in other solanaceous plants, including, but not 
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limited to, potato and tomato. To achieve pathogen resistance, it is important to express 
a P450 protein at an effective level. Evaluation of the level of pathogen protection 
conferred to a plant by ectopic expression of the P450 gene is determined according to 
conventional methods and assays. 

5 

Industrial applications 

The invention also includes engineering host cells to include novel isoprenoid 
metabolic pathways useful in the production of new isoprenoid compounds. By 
introducing genes encoding an isoprenoid synthase (as disclosed in U.S. Patent No. 

10 5,824,774 and WO 00/1 7327) and a cytochrome P450, an acetyltransferase, a methyl 

transferase, a fatty acyltransferase, or a combination thereof, various isoprenoid reaction 
products may be modified, controlled, or manipulated, resulting in enhancement of 
production of numerous isoprenoid reaction products, for example, the production of 
novel monoterpenes, diterpenes, and sesquiterpenes. Such compounds are useful as 

15 phytoalexins, insecticides, perfumes, and pharmaceuticals such as anti-bacterial and 
fungal agents. 

In one working example, an isoprenoid synthase or a chimeric isoprenoid 
synthase (as disclosed in U.S. Patent 5,824,774 and WO 00/17327) and a P450 gene are 
introduced into yeast, for example, using any of the procedures described herein. If 

20 desired, such cells may also express, either independently or in combination, an 

acetyltransferase (see, for example, Walker et aL, Proc. Natl. Acad. Sci. USA 18:583- 
587, 2000), a methylase transferase (see, for example, Diener et al., Plant Cell 12:853- 
870, 2000) gene, or a fatty acyltransferase gene, as well as a cytochrome reductase. 
Cells are then cultured under standard conditions and the production of isoprenoid 

25 compounds is assayed according to methods known in the art. Isoprenoid compounds 
are further purified according to methods well known in the art. Cells expressing novel 
isoprenoid compounds are taken as useful in the invention. 

Such methods provide a unique approach for producing novel isoprenoid starting 
materials and end products. Either prokaryotic or eukaryotic cells transformed with any 

30 of the aforementioned enzymes (or combinations thereof) may be used. Moreover, 
isoprenoid compounds may be produced in any number of ways known in the art 
including an in vitro combination of purified enzymes with an appropriate substrate or 
direct fermentation using a host cell which expresses any combination of the 
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aforementioned enzymes and the appropriate substrates sufficient to drive production of 
isoprenoid compounds. 

The invention is also useful for the production of insect attractants and 
deterrents, which may either deter insect pests or attract insect predators. In addition, 
5 the invention is also useful for generating novel flavorings and perfumes. 

Other embodiments 

From the foregoing description, one skilled in the art can easily ascertain the 
essential characteristics of this invention, and can make various changes and 
10 modifications of the invention to adapt it to various usages and conditions. Thus, other 
embodiments are also within the claims. 

All publications and patents mentioned in this specification are herein 
incorporated by reference to the same extent as if each individual publication or patent 
was specifically and individually indicated to be incorporated by reference. 

15 

We claim: 
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CLAIMS 

1 . A method for producing an isoprenoid compound, said method comprising 
the steps of: 

5 (a) culturing a host cell under conditions suitable for expressing a first 

recombinant protein comprising an isoprenoid synthase and a second recombinant 
protein comprising a cytochrome P450 polypeptide, wherein said first and second 
recombinant protein catalyze the formation of an isoprenoid compound not normally 
produced by said host cell; and 
10 (b) recovering said isoprenoid compound. 

2. The method of claim 1 , wherein said cytochrome P450 polypeptide is selected 
from the group consisting of a polypeptide comprising the amino acid sequence of SEQ 
ID NO:l, a polypeptide comprising the amino acid sequence of SEQ ID NO:3, a 
polypeptide comprising the amino acid sequence of SEQ ID NO:5, and a polypeptide 
comprising the amino acid sequence of SEQ ID NO:7. 

3. The method of claim 1, wherein said host cell further comprises a third 
recombinant protein selected from the group consisting of an acetyltransferase, a 
methyltransferase, and a fatty acyltransferase. 

4. The method of claim 1 , wherein said host cell expresses an endogenous or 
recombinant cytochrome reductase. 

25 5. The method of claim 1 , wherein said host cell is a yeast cell, a bacterial cell, 

an insect cell, or a plant cell. 

6. A host cell expressing a recombinant isoprenoid synthase and a recombinant 
cytochrome P450 polypeptide. 

30 

7. The host cell of claim 6, wherein said host cell further expresses a 
recombinant acetyltransferase, a recombinant methyltransferase, or a recombinant fatty 
acyltransferase. 
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8. The host cell of claim 6, wherein said host cell expresses an endogenous or 
recombinant cytochrome reductase. 

5 9. The host cell of claim 6, wherein said host cell is selected from the group 

consisting of a yeast cell, a bacterial cell, an insect cell, and a plant cell. 

10. An isolated cytochrome P450 polypeptide having 80% identity to the amino 
acid sequence of SEQ ID NO: 1 or SEQ ID NO:3. 

10 

11. The polypeptide of claim 10, said polypeptide comprising the amino acid 
sequence of SEQ ID NO:l. 

12. The polypeptide of claim 10, said polypeptide comprising the amino acid 
1 5 sequence of SEQ ID NO: 3 . 

13. An isolated cytochrome P450 polypeptide having 85% identity to the amino 
acid sequence of SEQ ID NO:5 or SEQ ID NO:7. 

20 14. The polypeptide of claim 13, said polypeptide comprising the amino acid 

sequence of SEQ ID NO:5. 

15. The polypeptide of claim 13, said polypeptide comprising the amino acid 
sequence of SEQ ID NO: 7. 

25 

16. An isolated cytochrome P450 polypeptide having 97% identity to the amino 
acid sequence of SEQ ID NO: 1 1 . 

17. The polypeptide of claim 16, said polypeptide comprising the amino acid 
30 sequence of SEQ ID NO: 1 1 . 
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18. An isolated cytochrome P450 nucleic acid molecule which encodes a 
polypeptide having 80% identity to the amino acid sequence of SEQ ID NO:l, SEQ ID 
NO:3, orSEQIDNO:ll. 

5 1 9. The isolated nucleic acid molecule of claim 1 8, wherein said nucleic acid 

molecule comprises SEQ ID NO:l. 

20. The isolated nucleic acid molecule of claim 18, wherein said nucleic acid 
molecule 

10 comprises SEQ ID NO:3. 

21. The isolated nucleic acid molecule of claim 18, wherein said nucleic acid 
molecule comprises SEQ ID NO:l 1 . 

15 22. An isolated cytochrome P450 nucleic acid molecule which encodes a 

polypeptide having 85% identity to the amino acid sequence of SEQ ID NO:5 or SEQ 
IDNO:7. 

23. The isolated nucleic acid molecule of claim 22, wherein said nucleic acid 
20 molecule comprises SEQ ID NO:5. 

24. The isolated nucleic acid molecule of claim 22, wherein said nucleic acid 
molecule comprises SEQ ID NO:7. 

25 25. An isolated nucleic acid molecule that specifically hybridizes under highly 

stringent conditions to any one of the complement of the sequence described in SEQ ID 
NOS:2, 4, 6, 8, or 12, wherein said nucleic acid encodes a cytochrome P450 
polypeptide. 

30 26. A vector comprising an isolated nucleic acid molecule selected from the 

group consisting of the isolated nucleic acid molecule of any one of claims 1 8-21 and 
22-24. 
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27. A host cell comprising an isolated nucleic acid molecule selected from the 
group consisting of the isolated nucleic acid molecule of any one of claims 18-21 and 
22-24. 

5 28. A plant or plant component comprising an isolated nucleic acid molecule 

selected from the group consisting of the isolated nucleic acid molecule of any one of 
claims 18-21 and 22-24. 

29. A process for conferring disease resistance on a plant or plant component, 
10 said method comprising the steps of: 

(a) introducing into a plant cell a transgene comprising a nucleic acid 
molecule selected from the group consisting of the isolated nucleic acid molecule of any 
one of claims 1 8-21 and 22-24 operably linked to a promoter functional in said plant cell 
to yield a transformed plant cell; and 
1 5 (b) regenerating a plant or plant component from said transformed plant cell, 

wherein the nucleic acid molecule is expressed in said plant or plant component, 
thereby conferring disease resistance on said plant or plant component. 

30. A method of producing a cytochrome P450 polypeptide in a host cell, said 
20 method comprising the steps of: 

(a) providing a host cell of claim 27; 

(b) culturing said host cell under conditions for expressing said isolated 
nucleic acid molecule encoding said cytochrome P450 polypeptide; and 

(c) recovering said cytochrome P450 enzyme. 

25 

31. A method for producing an altered compound, said method comprising the 
step of contacting said compound with the isolated polypeptide of any one of claims 1 0- 
15 under conditions allowing for the hydroxylation, oxidation, demethylation, 
methylation, or any combination of said enzymatic activity of said compound and 

30 recovering said altered compound. 

32. A hydroxylating agent comprising the isolated polypeptide of any one of 
claims 10-15. 
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SEQUENCE LISTING 

<110> University of Kentucky Research Foundation 
<12 0> Cytochrome P450s and Uses Thereof 



<130> 07678/100WO3 

<150> US 60/274,241 
<151> 2001-03-09 

<150> US 60/275,597 
<151> 2001-03-13 

<160> 43 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 473 
<212> PRT 

<213> Nicotiana tabacum 
<400> 1 

Met Gin Phe Phe Ser Leu Val Ser lie Phe Leu Phe Leu Ala Phe Leu 

15 10 15 

Phe Leu Leu Arg Lys Trp Lys Asn Ser Asn Ser Gin Ser Lys Lys Leu 

20 25 30 

Pro Pro Gly Pro Trp Lys lie Pro lie Leu Gly Ser Met Leu His Met 

35 40 45 

lie Gly Gly Glu Pro His His Val Leu Arg Asp Leu Ala Lys Lys Tyr 

50 55 60 

Gly Pro Leu Met His Leu Gin Leu Gly Glu lie Ser Ala Val Val Val 
65 70 75 80 

Thr Ser Arg Asp Met Ala Lys Glu Val Leu Lys Thr His Asp Val Val 

85 90 95 

Phe Ala Ser Arg Pro Lys lie Val Ala Met Asp lie lie Cys Tyr Asn 

100 105 110 

Gin Ser Asp lie Ala Phe Ser Pro Tyr Gly Asp His Trp Arg Gin Met 

115 120 125 

Arg Lys lie Cys Val Met Glu Leu Leu Asn Ala Lys Asn Val Arg Ser 

130 135 140 

Phe Ser Ser lie Arg Arg Asp Glu Val Val Arg Leu lie Asp Ser lie 
145 150 155 160 

Arg Ser Asp Ser Ser Ser Gly Glu Leu Val Asn Phe Thr Gin Arg lie 

165 170 175 

lie Trp Phe Ala Ser Ser Met Thr Cys Arg Ser Ala Phe Gly Gin Val 

180 185 190 

Leu Lys Gly Gin Asp lie Phe Ala Lys Lys lie Arg Glu Val lie Gly 
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Asp 
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Gly 
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Mat* 
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Asn 


Pro 
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AT ^ 
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Phe 


Arg 


Asp 


one 








Lys 


Tyr 


Leu 


Lys 


Ser 


Pro 


Leu 


Leu 








340 




Tyr 


Thr 


lie 






355 




L6U 


Gly Arg Asp 




370 






bill 


Arg 


Phe 


Glu 


3 85 








Pile 


Leu 


Pro 


Phe 


Gly 


Leu 


Ala 


Asn 








420 


Asp 


Trp 


Lys 


Leu 






435 




Glu 


Leu 


Ser Gly 




450 






Ala 


Thr 


Pro 


Tyr 



465 



200 

Phe Asp Val Val 
215 

Ser Gly Met Lys 
230 

lie Val Glu Asp 
245 

Lys Ser Asn Gly 

Ser Ser Thr Thr 
280 

Ser Val Phe Thr 
295 

Lys Val Ser Phe 
310 

Leu Val lie Lys 
325 

Val Pro Arg Glu 

Pro Ala Lys Thr 
360 

Pro Lys Tyr Trp 
375 

Gin Cys Ser Val 
390 

Gly Gly Gly Arg 
405 

Leu Tyr Leu Pro 

Pro Thr Gly lie 
440 

He Thr He Ala 
455 

Gin Pro Ser Arg 
470 



Asp He Phe Pro 
220 

Arg Lys Leu Leu 
235 

Val He Asn Glu 
250 

Ala Leu Glu Asp 
265 

Thr Val Trp Ala 

Lys Ala Gin Ala 
300 

Asp Glu Asn Asp 
315 

Glu Thr Leu Arg 
330 

Cys Arg Glu Asp 
345 

Lys Val Met Val 

Asp Asp Ala Glu 
380 

Asp Phe Phe Gly 
395 

Arg He Cys Pro 
410 

Leu Ala Gin Leu 
425 

Met Pro Arg Asp 

Arg Lys Gly Asp 
460 

Glu 



205 

Thr Tyr Lys Phe 

Asn Ala His Leu 
240 

His Lys Lys Asn 
255 

Met Phe Ala Ala 
270 

Met Ala Glu Met 
285 

Glu Val Arg Glu 

Val Glu Glu Leu 
320 

Leu His Pro Pro 
335 

Thr Asp He Asn 
350 

Asn Val Trp Ala 
365 

Ser Phe Lys Pro 

Asn Asn Phe Glu 
400 

Gly Met Ser Phe 
415 

Leu Tyr His Phe 
430 

Leu Asp Leu Thr 
445 

Leu Tyr Leu Asn 



<210> 2 
<211> 1660 
<212> DNA 

<213> Nicotiana tabacum 



<400> 2 

ggatggtcta ataatcctcc 
tttcctcttc ctagctttcc 
caaaaaattg ccaccaggtc 
tggtggagaa ccgcaccatg 
ccttcagtta ggtgaaattt 
gctaaaaact catgaegteg 
ctgttataac cagtccgaca 
taaaatttgt gtcatggaac 



atttatctcc gaaaatgcaa 
tatttttgtt gaggaaatgg 
catggaaaat accaatacta 
tccttagaga tttagccaaa 
ctgcagttgt ggttacttct 
tttttgeate taggectaaa 
ttgectttag cccttatggc 
ttctcaatgc aaagaatgtt 



ttcttcagct tggtttccat 60 
aagaactcca atagecaaag 120 
ggaagtatgc ttcatatgat 180 
aaatatggac cacttatgea 240 
a 999acatgg caaaagaagt 300 
attgtageca tggacattat 360 
gaccactgga gaeaaatgeg 42 0 
eggtctttea gctccatcag 480 
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acgtgatgaa gtcgttcgtc tcattgactc tatccggtca gattcttctt caggtgagct 540 
agttaatttt acgcagagga tcatttggtt tgcaagctcc atgacgtgta gatcagcatt 60 0 
tgggcaagta ctcaaggggc aagacatatt tgccaaaaag atcagagaag taataggatt 660 
agcagaaggc tttgatgtgg tagacatctt ccctacatac aagtttcttc atgttctcag 720 
tgggatgaag cgtaaacttt tgaatgccca ccttaaggta gacgccattg ttgaggatgt 7 80 
catcaacgag cacaagaaaa atcttgcagc tggcaaaagt aatggcgcat tagaggacat 84 0 
gtttgctgcc ggaacagaaa cttcatcaac aacaactgta tgggctatgg ctgaaatgat 900 
gaagaatcca agtgtattca ccaaagctca agcagaagtg cgagaagcct ttagggacaa 960 
agtatctttt gatgaaaatg atgtggagga gctgaaatac ttaaagttag tcattaaaga 102 0 
aactttgaga cttcatccac cgtctccact tttggtccca agagaatgca gggaagatac 1080 
ggatataaac ggctacacta ttcctgcaaa gaccaaagtt atggttaatg tttgggcatt 114 0 
gggaagagat ccaaaatatt gggatgacgc ggaaagcttt aagccagaga gatttgagca 12 00 
atgctctgtg gatttttttg gtaataattt tgagtttctt ccctttggcg gtggacggag 12 60 
aatttgtcct ggaatgtcat ttggtttagc taatctttac ttgccattgg ctcaattact 1320 
ctatcacttt gactggaaac tcccaaccgg aatcatgcca agagacttag acttgaccga 13 80 
attatcggga ataactattg ctagaaaggg tgacctttac ttaaatgcca ctccttatca 144 0 
accttctcga gagtaattca atattggcat aaacatttta aatttccttc atcaacctca 1500 
atattgtaca ataatcattc ttctggtgtt ataggcttta tcgatttcca atacatgtat 1560 
tctttattaa aaaatgtatc acattccatg tagaaggagg acgcaccaat taattgtgcc 162 0 
atgattttag ggtaacttgt tccatcttaa aaaaaaaaaa 1660 

<210> 3 
<211> 474 
<212> PRT 

<213> Nicotiana tabacum 
<400> 3 

Met Glu Phe Phe Ser Leu val Ser lie Phe Leu Phe Leu Ser Phe Leu 

15 10 15 

Phe Leu Leu Arg Lys Cys Lys Asn Ser Asn Ser Gin Thr Lys Gin Leu 

20 25 30 

Pro Pro Gly Pro Trp Lys lie Pro lie Leu Gly Ser Met Leu His Met 

35 40 45 

Leu Gly Gly Glu Pro His His lie Leu Arg Asp Leu Ala Lys Lys Tyr 

50 55 60 

Gly Pro lie Met His Leu Gin Phe Gly Glu lie Ser Ala Val Val Val 
65 70 75 80 

Thr Ser Arg Glu Met Ala Lys Glu Val Leu Lys Thr His Asp Val Val 

85 90 95 

Phe Ala Ser Arg Pro Lys He Val Ala Met Asp He He Cys Tyr Asn 

100 105 110 

Gin Ser Asp He Ala Phe Ser Pro Tyr Gly Asp His Trp Arg Gin Met 

115 120 125 

Arg Lys He Cys Val Met Glu Leu Leu Asn Ala Lys Asn Val Arg Ser 

130 135 140 

Phe Ser Ser He Arg Arg Asp Glu Val Val Arg Leu He Asp Ser He 
145 150 155 160 

Arg Ser Asp Ser Ser Ser Gly Glu Leu Val Asn Phe Thr Gin Arg He 

165 170 175 

He Trp Phe Ala Ser Ser Met Thr Cys Arg Ser Ala Phe Gly Gin Val 
180 185 190 
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Leu Lys Gly Gin Asp Val Phe Ala Lys Lys He Arg Glu Val He Gly 
195 200 205 



Leu. 


Ala 
Aid 


o JLU 




rTie 


Asp 


V a. X 


Al a 
Ala 


Asp 


lie 


rile 


Pro 


Ser 


Tyr 


Lys 


riie 




ZlU 










Z 1 b 










0 0 rt 
z z Q 










Leu 


-MX 9 


\7» 1 
val 


Leu 


Ser 


vjiy 




Lys 


Arg 


Lys 


Leu 


Leu 


Asn 


Ala 

ai a 


Ml s 


Leu 


z z o 










"5 *a ft 
Z J (J 










T -3 C 

23b 










24 0 


Lys 


val 


Asp 


Ala 
Ala 


Tip 

lie 


\7a 1 
val 


nl ii 


Asp 


val 


i ie 


Asn 


pin 

blU 


MX S 


Lys 


Lys 


Asn 










Z 1 b 










2 50 










O r r 

Z 5 5 




Leu 




Thr 


Gl \r 
uxy 


Lys 


Thr 


As n 


Gly 


nxa 


Leu 


ox y 


m 11 


Asp 




xrlltr 


ai a 

jh.x a 








ft 

Z o U 










o *r c 
z ob 










O 1 ^ 
z / U 






Al o 
Aid 




X 11X7 


UlU 


lllX 


Ser 


ser 


inx 


A ill 


1 11X 


\7a 1 
Val 


Trp 


Al 




Al a 
Ala 


bill 






*d *7 c 
z /b 










"> Q ft 

Z o U 










O Q c 
Zoo 








Mp r- 




Lys 


Asn 


Pro 


Asn 


val 


r lie 


Asn 


Lys 


A 1 a 
nld 


Ol n 


Al a 

ai a 


f^l 11 

VJX U 


Val 
vai 


Arg 




n q a 

z yu 










O Q C 

Z 27b 










*> ft ft 
3 OU 










vj J. U 


X 111. 


Phe 


Lys 


Asp 


Lys 


V d X 


X 111 


XT llC 


Asp 


rzl ii 

UlU 


Tip 

X JLC 


Asp 


Al a 
ax a 


f*3l 11 
olU 


m n 

VjIU 












lift 

JlU 










JlD 










""5 O ft 
J Z U 


Leu 


rsi n 


Tyr 


Leu 


Lys 


Leu 


V CI X 


Tl ^ 
X X c 


Lys 


m ii 

UlU 


1 111 


Leu 


Arg 


Leu 


illo 


Pro 










325 










-3 -3 n, 










*3 O C 

Job 




Pro 


Ser 


Pro 


Leu 


Leu 


vai 


Pro 


Arg 


pi,, 

vjxU 


Cys 


Arg 


VjIU 


Asp 


1 nr 


Asp 


ne 








340 










345 










350 






Asn 


vj-Ly 


Tyr 


Thr 


He 


Pro 


Ala 

ai a 


Lys 


inr 


Lys 


\7=» 1 
Val 


rlCL 


\7s» 1 
Val 


Asn 


\7a 1 
val 


Trp 






355 










360 










365 








Ala 

Aia 


Leu 


Gly Arg Asp 


Pro 


Lys 


Tyr 


Trp 


Asp 


Asp 


Ala 

Aia 


LjIU 


Ser 


ir'ne 


Lys 




370 










375 










3 80 










Pro 


VjlU 


Arg 


Phe 


Glu 


uin 


Cys 


Ser 


Val 


Asp 


T3 V» £i 


T-) Vl «=k 

rile 


vjiy 


Asn 


Asn 


rile 


385 










390 










395 










400 


Glu 


Phe 


Leu 


Pro 


Phe 


Gly 


Gly 


Gly 


Arg 


Arg 


He 


Cys 


Pro 


Gly 


Met 


Ser 










405 










410 










415 




Phe 


Gly 


Leu 


Ala 


Asn 


Leu 


Tyr 


Leu 


Pro 


Leu 


Ala 


Gin 


Leu 


Leu 


Tyr 


His 








420 










425 










430 






Phe 


Asp 


Trp 


Lys 


Leu 


Pro 


Ser 


Gly 


Met 


Met 


Pro 


Gly 


Asp 


Leu 


Asp 


Leu 






435 










440 










445 








Thr 


Glu 


Leu 


Ala 


Gly 


He 


Thr 


He 


Ala 


Arg 


Lys 


Gly 


Asp 


Leu 


Tyr 


Leu 




450 










455 










460 










Met 


Ala 


Thr 


Pro 


Tyr 


Gin 


Pro 


Ser 


Arg. 


Glu 















465 470 



<210> 4 
<211> 1614 
<212> DNA 

<213> Nicotiana tabacum 
<400> 4 

ggatggtcta ataatcctcc atttatctcc caaaatggaa ttcttcagct tggtttccat 60 

attcctattc ctatctttcc tctttttgtt aaggaaatgt aagaactcca atagecaaac 120 

caaacaattg cctccaggtc catggaaaat accaatacta ggaagtatgc ttcatatget 180 

tggtggagaa ccacaccata tccttaggga tttagccaaa aaatatggac caattatgea 240 

ccttcagttt ggtgaaattt ctgcagttgt ggttacttct agggagatgg caaaagaagt 3 00 

gctaaaaact catgaegtag tttttgeate taggectaaa attgtggcca tggacattat 360 

ctgttataac cagtctgata tegectttag cccttatggc gatcactgga gaeaaatgeg 420 
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taaaatttgt 
acgtgatgaa 
agttaatttt 
tgggcaagta 
agcagaaggc 
tggaatgaag 
catcaacgag 
gtttgctgcc 
gaagaatcca 
agtaacattt 
aactttgaga 
agatattaac 
gggaagagat 
atgctctgtg 
aatatgtcct 
atatcacttt 
attagctgga 
accttctcgc 
tgtacaagct 
atgtatcata 



gtcatggaac 
gtcgttcgtc 
acgcaaagga 
cttaaggggc 
tttgatgtgg 
cgtaaacttc 
cacaagaaaa 
ggaacagaaa 
aatgtattca 
gatgaaattg 
cttcatccac 
ggctacacta 
ccaaaatatt 
gatttttttg 
ggtatgtcat 
gattggaaac 
ataacaattg 
gaataattta 
ttctgatgtt 
cccatgtaga 



ttcttaatgc 
tcattgactc 
tcatttggtt 
aagacgtatt 
ccgatatctt 
tgaatgccca 
atcttgcaac 
cttcatcaac 
acaaagctca 
atgcagagga 
cgtctccact 
ttcctgcgaa 
gggatgacgc 
gtaataattt 
ttggtttagc 
tcccgagcgg 
ctagaaaggg 
atggcatcag 
tcaggttttg 
agagggacga 



aaagaatgtt 
tattcgatca 
cgcgagctcc 
tgccaaaaag 
cccttcatac 
ccttaaggta 
tgggaaaact 
aacaactgta 
ggcagaagtg 
gctggaatac 
tttggtccca 
gaccaaagtt 
agaaagcttt 
tgagtttctt 
taatctttac 
aatgatgccc 
tgacctttac 
gttttttaat 
ccgatttgta 
ttaattagtt 



cggtctttca 
gattcttctt 
atgacgtgta 
attagagaag 
aagtttcttc 
gatgccattg 
aatggagcat 
tgggctatgg 
agagaaacct 
ttaaagttag 
agagaatgta 
atggttaatg 
aagccagaga 
ccctttggcg 
ttgccattgg 
ggagacttgg 
ttaatggcta 
tccattgtca 
ataaatgtag 
gtaaaaaaaa 



gctcgatcag 
ctggtgagct 
gatcagcatt 
taatagggtt 
atgttctcag 
ttgaggatgt 
taggagacat 
ctgaaatgat 
ttaaagacaa 
ttattaaaga 
gggaagatac 
tttgggcatt 
gatttgagca 
gtggacggag 
ctcaattgct 
acttgactga 
ctccttatca 
acctcactat 
tttttataat 
aaaa 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1614 



<210> 5 
<211> 537 
<212> PRT 

<213> Nicotiana tabacum 



<400> 5 
Met Lys 
1 

Phe Thr 

Ser Met 

Pro Leu 
50 

Pro Gly 
65 

Asn Asp 

Pro lie 

Asn Pro 

Gly Ser 
130 
Gin Asp 
145 

Arg lie 
Ser Asp 



Asn Met 

He Ala 

20 
Pro Phe 

35 

He He 

Pro Thr 

Leu Asn 

Phe Leu 
100 
Glu Leu 
115 

Arg Pro 

Met Val 

Met Thr 

Met Trp 
180 



Ala Lys 
5 

Phe Leu 

Pro Leu 

Asn Phe 

Ala Val 

70 
His Gin 
85 

Leu Lys 

Ala Asp 

Arg Asn 

Phe Thr 
150 
Leu Pro 
165 

Glu Asn 



Leu Leu Asn 

Ser Phe Ala 
25 

Lys Tyr Met 
40 

Leu Tyr Val 
55 

Pro He Phe 

Leu Leu Ala 

Leu Gly Ser 
105 

Gin Val Leu 

120 
val Val Phe 
135 

He Tyr Gly 

Phe Phe Thr 

Glu Met Asp 
185 



Lys Thr 
10 

Lys Leu 

Ser Leu 

Lys Pro 

Gly Asn 

75 
Thr Met 
90 

Lys Asn 

His Thr 

Asp He 

Asp His 
155 
Asn Lys 
170 

Leu Val 



He Phe 

Leu Ser 

He Val 
45 

Gin Asn 
60 

Trp Leu 

Ser Gin 

Leu Ala 

Gin Gly 
125 
Phe Thr 
140 

Trp Arg 
Val Val 
Val Asn 



Cys He Leu 
15 

Ser Tyr Leu 
30 

Pro Leu Leu 

Asn Leu Pro 

Gin Val Gly 
80 

Thr Tyr Gly 
95 

Val Val Ser 
110 

Val Glu Phe 

Gly Asn Gly 

Lys Met Arg 
160 

His Gin Tyr 
175 

Asp Leu Lys 
190 
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Lys Asn Glu Lys 
195 

Gin Leu Met Leu 
210 

Phe Glu Ser Gin 
225 

Ser Glu Arg Ser 

Phe lie Pro Leu 
260 

Lys Asp Leu Gin 
275 

Glu Lys Arg Arg 
290 

Ser Cys Ala lie 
305 

Asn Glu Gin Asn 

lie Glu Thr Thr 
340 

Asn His Pro lie 
355 

Leu Lys Gly Arg 
370 

Leu Leu Ala Thr 
385 

Leu Leu Val Pro 

Thr He Pro Lys 
420 

Asn Asn Pro Ala 
435 

Phe Leu Glu Glu 
450 

Val Asp Phe Arg 
465 

Gly He He Leu 

Val Ser Asn Phe 
500 

Thr Ser Glu Arg 
515 

Thr Val Val Phe 
530 



<210> 6 

<211> 1745 

<212> DNA 

<213> Nicotiana 



Val 


Lys 


Tyr 


Glu 








200 


Tyr Asn 


He 


Met 






± Zj 




Asn 


Asp 


Pro 


Leu 




230 






Arg 


Leu 


Ala 


Gin 


245 








Leu 


Arg 


XT X. \J 


rue 


Thr 
i ii J. 


Arg 


Arg 


Leu 








280 


Lys 


Tip 


Met 


Asp 






Z J D 




Asp 


His 


He 


Tl e 

11C 




310 






Val 


Leu 


Tyr 


Tl e 
1 1 e 


325 








Leu 


Trp 






v ax 


vjjLn 


Gin 


Lys 








360 


Ser 


v a jl 


Thr 


Glu 






O / D 




va j. 


Asn 




1 1 1 X. 




•"ton 






rilS 


Ma4- 


> 

Asn 


Leu 


405 








Glu 


Thr 


Lys 


Val 


Trp 


Trp 


Lys 


Asn 








440 


Asp 


Ser 


Ser 


Thr 






455 




Tyr 


Leu 


Pro 


flit: 




470 






Ala 


Leu 


Pro 


He 


485 








Glu 


Met 


Gin 


Gly 


Gly 


Gly Gin 


Phe 








520 


Lys 


Pro 


He 


Ala 






535 





tabacum 



Glv 


He 


Val 


lie 


Tyr 


Arg Met 


Met 








220 


Phe 


He 


Glu 


Ala 






235 




Ser 


Phe 


Asp 


Ayr 




250 






Leu 


Arg Gly 


Tyr 










Ala 


Phe 


Phe 


Asn 


Glu 


Asn Gly 


Glu 








300 


Asp 


Ala 


Glu 


Met* 






315 




Val 

VAX 


Glu 


Asn 


Tie 
lie 




330 






yiu 


Trp 


Ala 


Tie 
lie 


*a /i c 








lie 


Arg Asp 


Pi,, 


Ser 


Asn 


Leu 


His 








380 


Leu 


Arg 


Leu 


HIS 






395 




vjIu 


Glu 


Ala 


Lys 




410 






vdl 


Val 


Asn 


Ala 
Hid 










Pro 


Asn 


Glu 


Ir lie 


Glu 


Ala 


Ala 


Val 








460 


Gly 


Met 


Gly 


Arg 






475 




Leu 


Gly Leu 


Val 




490 






Pro 


Pro 


Gly 


Val 


505 








Ser 


Leu 


His 


He 



Ala 



Arg Lys Arg Leu 
205 

Phe Asp Ala Lys 

Thr Lys Phe Asn 
240 

Asn Tyr Gly Asp 
255 

Leu Asn Lys Cys 
270 

Asn Tyr Phe Val 
285 

Lys His Lys He 

Lys Gly Glu He 
320 

Asn Val Ala Ala 
335 

Ala Glu Leu Val 
350 

He Ser Thr Val 
365 

Glu Leu Pro Tyr 

Thr Pro He Pro 
400 

Leu Gly Gly Tyr 
415 

Trp Trp Leu Ala 
430 

Arg Pro Glu Arg 
445 

Ala Gly Gly Lys 

Arg Ser Cys Pro 
480 

He Ala Lys Leu 
495 

Glu Lys Val Asp 
510 

Ala Lys His Ser 
525 
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<400> 6 

cctctagcta 

tacaattgca 

tcttaagtac 

taagccccaa 

tcaagttggc 

tatattttta 

tgaccaagtt 

cgacatattt 

aaaaatgagg 

tgatatgtgg 

gaaatatgag 

tcgaatgatg 

aaagtttaat 

tatcccttta 

aaggagactt 

aaatggagaa 

aggagaaata 

tgaaacaact 

tcaacagaag 

aaacctccat 

accaatacct 

tattcctaaa 

gtggaaaaac 

ggcagctgtt 

gagctgcccc 

gtcaaatttt 

agggcagttt 

ataataatat 

tactaagtta 



atgaaaaaca 
tttctttcat 
atgtcactta 
aacaacctcc 
aatgacttga 
ctcaaacttg 
ctacacacac 
actggtaatg 
cgtattatga 
gagaatgaga 
ggaattgtga 
tttgatgcca 
tcagagagaa 
cttagaccat 
gcattcttca 
aagcataaga 
aatgagcaaa 
ctatggtcca 
attagggatg 
gagctgcctt 
ttacttgtac 
gaaactaagg 
ccgaatgaat 
gctggtggca 
ggaatcatcc 
gaaatgcagg 
agcttgcaca 
gcttaagcta 
ctcgataaag 



tggccaaact 
ttgccaagtt 
ttgtcccttt 
cacctggtcc 
accatcaact 
gttcaaaaaa 
aaggggtcga 
gacaagacat 
cgcttccatt 
tggacttagt 
ttaggaaacg 
aatttgagtc 
gcagattagc 
tcttgagagg 
acaattattt 
taagctgtgc 
atgtactcta 
tggaatgggc 
aaatctcaac 
acttgctagc 
cccatatgaa 
tggttgtgaa 
tccggcccga 
aggtagattt 
ttgcactgcc 
gtccaccagg 
ttgcaaaaca 
tccttgtttt 
atttcaatga 



tctcaacaag 
actgtcctcc 
acttcccctt 
aacagcagtc 
ccttgccacc 
cctagctgtg 
gtttgggtcc 
ggtgttcacc 
tttcactaac 
tgttaatgac 
attgcagctg 
ccaaaatgat 
tcagagcttt 
gtaccttaac 
tgtagagaaa 
tattgatcac 
tattgtggag 
catagctgaa 
agtcctcaaa 
aacagtaaat 
ccttgaagaa 
tgcgtggtgg 
gaggtttctt 
caggtactta 
aattctgggg 
tgtggaaaag 
ttccacggtt 
aattatattt 
atattacagt 



accatctttt 
tacctatcta 
ataatcaact 
ccaatatttg 
atgtcacaaa 
gtatcaaacc 
cgtccacgta 
atttatggtg 
aaagtggtgc 
ttgaagaaga 
atgctgtata 
cctttgttca 
gactacaatt 
aagtgtaaag 
agaaggaaaa 
attatagatg 
aatatcaatg 
cttgtaaatc 
ggcagatcag 
gaaacattaa 
gcaaagttag 
ctggctaaca 
gaggaggata 
ccattcggta 
cttgtcatag 
gttgatacaa 
gtcttcaagc 
gtcttaccag 
ttttgttaaa 



gcattctctt 
tgcctttccc 
tcctctatgt 
gtaattggct 
cctacggtcc 
cagagctagc 
acgttgtctt 
accattggcg 
accaatatag 
atgaaaaagt 
acatcatgta 
ttgaggcaac 
atggtgattt 
acttacaaac 
taatggatga 
ccgaaatgaa 
ttgcagcaat 
atcccattgt 
tcacagaatc 
gactccacac 
gtggttacac 
accctgcttg 
gtagcacaga 
tggggaggcg 
ccaaactggt 
gtgaaagagg 
ctattgctgc 
aaagcaaaac 
aaaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1745 



<210> 7 

<211> 534 

<212> PRT 

<213> Nicotiana tabacum 



<400> 7 

Met Ala Lys Leu 
1 

Val Phe Leu Ser 
20 

Phe Pro Leu Glu 
35 

lie Asn Phe Leu 
50 

Thr Ala Val Pro 
65 

Asn His Gin Leu 
Leu Leu Lys Leu 



Leu Asn Asn Thr 
5 

Phe Ala Lys Leu 

Tyr lie Ser Leu 
40 

Cys Val Lys Pro 
55 

lie Phe Gly Asn 
70 

Leu Ala Thr Met 
85 

Gly Ser Lys Asn 



lie Phe Cys lie 
10 

Leu Ser Ser Tyr 
25 

lie Val Leu Leu 

Gin Asn Asn Leu 
60 

Trp Leu Gin Val 
75 

Ser Gin Thr Tyr 
90 

Leu Ala Val Val 



Leu Phe Ser He 
15 

Leu Ser He Pro 
30 

Leu Pro Leu He 
45 

Pro Pro Gly Pro 

Gly Asn Asp Leu 
80 

Gly Pro He Phe 
95 

Ser Asn Pro Glu 
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100 

Leu Ala Asn Gin Val 
115 

Pro Arg Asn Val Val 
130 

Val Phe Thr lie Tyr 
145 

Thr Leu Pro Phe Phe 
165 

Trp Glu Asn Glu Met 
180 

Lys Val Lys Tyr Asp 
195 

Leu Tyr Asn lie Met 
210 

Gin Asp Asp Pro Leu 
225 

Ser Arg Leu Ala Gin 
245 

Leu Leu Arg Pro Phe 
260 

Gin Thr Arg Arg Leu 
275 

Arg Lys lie Met Gly 
290 

He Asp His He He 
305 

Asn Val Leu Tyr He 
325 

Thr Leu Trp Ser Met 
340 

He Val Gin Gin Lys 
355 

Lys Ser Val Lys Glu 
370 

Thr Val Asn Glu Thr 
385 

Pro His Met Asn Leu 
405 

Lys Glu Thr Lys Val 
420 

Ala Trp Trp Lys Asn 
435 

Glu Asp Ser Ser Thr 
450 

Arg Tyr Leu Pro Phe 
465 

Leu Ala Leu Pro He 
485 

Phe Glu Met Gin Ala 
500 



105 



Leu 


His 


Thr 


ij-L n 


Gly Val 






120 








Phe 


Asp 


He 


r*ne 


Thr 


(51 v 




135 










Gly Asp His 


Trp 


Arg 


Lys 


150 












Thr 


Asn Lys 


vai 


Val 


His 










i / u 




Asp 


Leu 


val 


vai 


Asp 


Asp 














Gly 


He 


Val 


ne 


Arg 


Lys 






200 








Tyr Arg 


Met 




XT IlC 


Asp 




215 










Phe 


He 


Glu 


Aia 


1 111 


Lys 


230 










o *> c 


Ser 


Phe 


Asp 


Tyr 


Asn 


Tyr 










2 50 




Leu 


Lys 


Gly 


Tyr 


Leu 


Asn 








2 65 






Ala 


Phe 


Phe 


Asn 


Asn 


Tyr 






280 








Glu Asn Gly 


Glu 


Lys 


His 




295 










Asp 


Ala 


Glu 


Met 


Lys 


Gly 


310 










3 15 


Val 


Glu 


Asn 


He 


Asn 


vai 










33 0 




Glu 


Trp 


Ala 


He 


Aid 


ulu 








*3 A C 






He 


Arg 


Asp 


Glu 


Tip 

lie 


Ser 






360 








Ser 


Asn 


Leu 


His 




Leu 




375 










Leu 


Arg 


Leu 


ril S 


Thr 


Pro 


390 










J " J 


Glu 


Glu 


Ala 


Lys 


I eu 


m v 

oiy 










din 




Val 


Val 


Asn 




Trr> 


Tm 








425 






Gin 


Asn 


Glu 


Phe 


Arg 


Pro 






440 








Glu 


Ala 


Ala 


Val 


Ala 


Gly 




455 










Gly 


Met 


Gly 


Arg 


Arg 


Ser 


470 










475 


Leu 


Gly 


Leu 


Val 


He 


Ala 










490 




Pro 


Pro 


Gly 


Val 


Gly 


Lys 



505 




PCT/US02/06912 



110 

Glu Phe Gly Ser Arg 
125 

Asn Gly Gin Asp Met 
140 

Met Arg Arg He Met 
160 

Gin Tyr Ser Asp Met 
175 

Leu Lys Lys Asn Glu 
190 

Arg Leu Gin Leu Met 
205 

Ala Lys Phe Glu Ser 
220 

Phe Asn Ser Glu Arg 
240 

Gly Asp Phe He Pro 
255 

Lys Cys Lys Asp Leu 
270 

Phe Val Gly Lys Arg 
285 

Lys He Cys Cys Ala 
300 

Glu He Ser Glu Gin 
320 

Ala Ala He Glu Thr 
335 

Leu Val Asn His Pro 
350 

Thr Val Leu Lys Gly 
365 

Pro Tyr Leu Leu Ala 
380 

He Pro Leu Leu Val 
400 

Gly Tyr Thr He Pro 
415 

Leu Ala Asn Asn Pro 
430 

Glu Arg Phe Leu Glu 
445 

Gly Lys Val Asp Phe 
460 

Cys Pro Gly He He * 
480 

Lys Leu Val Ser Asn 
495 

Val Asp Thr Ser Glu 
510 
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Lys Gly Gly Gin Phe Ser Leu His lie Ala Lys His Ser Thr Val Val 

515 520 525 

Phe Lys Pro lie Ala Ala 
530 



<210> 8 
<211> 1693 
<212> DNA 

<213> Nicotiana tabacum 
<400> 8 

cctctagcta attaaaaaca tggccaaact tctcaacaac accatctttt gcattctctt 60 
ttcaattgta tttctttcat ttgccaaatt actatcctcc tacctctcta tacctttccc 120 
tcttgagtac atttcactta ttgtcctttt acttccccta ataatcaact tcctctgtgt 180 
taagccccaa aacaacctcc cacctggtcc aacagcagtc ccaatttttg gtaattggct 240 
tcaagttggc aatgacttga accatcaact ccttgccacc atgtcacaaa cctatggtcc 300 
tatattttta ctcaaacttg gttcaaaaaa cctagctgtg gtatcgaacc ctgagctagc 3 60 
taaccaagtt ctacacacgc aaggggtcga gtttgggtcc cgtccacgta acgttgtctt 420 
tgatatattt actggtaatg gacaagacat ggtgttcacc atttatggtg accattggcg 4 80 
aaaaatgagg cgtattatga cgcttccatt tttcactaac aaagtggtgc accaatatag 540 
tgatatgtgg gagaatgaga tggacctagt tgttgatgac ttgaagaaga atgaaaaagt 600 
gaaatatgac ggaattgtga ttaggaaacg attgcagctg atgctatata acattatgta 660 
tcgaatgatg tttgatgcca agtttgagtc ccaagatgat cctttgttca ttgaggcaac 72 0 
aaagtttaat tcagagagaa gcagattagc tcagagcttt gactacaatt atggtgattt 780 
tatccctttg cttagaccat tcttgaaagg gtaccttaac aagtgcaaag acttacaaac 840 
aaggagactt gcattcttca acaattattt tgtagggaaa agaaggaaaa taatgggtga 900 
aaatggagaa aaacacaaga tatgttgtgc tattgatcac attatagatg ctgaaatgaa 960 
aggagaaata agtgagcaaa atgtactcta tattgtggag aatatcaatg ttgcagcaat 102 0 
tgaaacaact ctatggtcca tggaatgggc catagctgag cttgtaaatc atcccattgt 10 80 
tcaacagaag attagggatg aaatctcaac agtcctcaaa ggaaagtcag tcaaagaatc 114 0 
aaacctacat gagctgcctt acttgctagc aacagtaaat gaaacattaa gactccacac 1200 
accaatacct ttacttgtac cacatatgaa ccttgaagaa gcaaagctag gtggttacac 1260 
tattcctaaa gaaactaagg tggttgtgaa tgcatggtgg ctggctaaca accctgcctg 1320 
gtggaaaaac cagaacgaat tccggcccga gcggtttctc gaggaggata gtagcacaga 13 80 
ggcagctgtt gctggtggca aggttgattt caggtacttg cccttcggta tggggaggcg 1440 
gagctgcccc ggaatcatcc ttgcactgcc aattctgggg cttgtcatag ccaaactggt 1500 
gtcaaatttt gaaatgcagg ctcctccagg tgtaggaaaa gttgatacaa gtgagaaagg 1560 
agggcagttt agcttgcaca ttgcaaaaca ttccacggtt gtcttcaagc ctattgctgc 1620 
ataatattac agtttttgtt actctataaa gatttcaatg aatattacag tttttgttaa 1680 
aaaaaaaaaa aaa 1693 

<210> 9 
<211> 519 
<212> PRT 

<213> Nicotiana tabacum 
<400> 9 

Met Tyr His Leu Leu Ser Pro lie Glu Ala lie Val Gly Leu Val Thr 

15 10 15 

Phe Ala Phe Leu Leu Tyr Leu Leu Trp Thr Lys Lys Gin Ser Lys lie 
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He 
210 


Leu 
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Thr Thr Asn Ala 

435 . 
Pro Phe Gly Ser 
450 

Gin Val Glu His 
465 

Ala Thr Thr Thr 

Thr Leu Pro Lys 
500 

Pro Pro Thr Leu 
515 



Ala lie Asp Tyr 
440 

Gly Arg Arg Ala 
455 

Leu Ser lie Ala 
470 

Asn Glu Pro Leu 
485 

Lys Thr Asp Val 
Tyr Gin Tyr 



Arg Gly Gin His 

Cys Pro Ala Met 
460 

His Leu lie Gin 
475 

Asp Met Lys Gin 
490 

Glu Val Leu He 
505 



Tyr Glu Leu He 
445 

Asn Tyr Ser Leu 

Gly Phe Asn Phe 
480 

Gly Val Gly Leu 
495 

Thr Pro Arg Leu 
510 



<210> 10 
<211> 1578 
<212> DNA 

<213> Nicotiana tabacum 



<400> 10 

atgtatcatc 

ctctacttgc 

ccaggtggat 

cgccattttt 

cggttagggt 

tctaccaatg 

tataacaatg 

ctagtcaatc 

tctatagttc 

aaccttagtg 

aagacctata 

caagctatgg 

cttacaggga 

ggttggttag 

caagatttta 

tctcatgaca 

cttgcacttc 

aaagcacaag 

atcaagaatt 

gcacctttgt 

cctaagggga 

gtagatcctg 

cgcgggcagc 

aattactcat 

gcaactacga 

aagacagatg 

tatgttttgt 



ttctttctcc 
tatggacaaa 
ggccagtaat 
ctcaaaaact 
ttcgccgttt 
atatccattt 
ccatgcttgc 
aagaacttct 
agaaaaatat 
attggataga 
ataatggaca 
agattgagct 
atattaaggc 
atgagcacat 
ttgatgtgct 
ccaccatcaa 
atataaagtg 
aagagatgga 
tggtgtatct 
cagtacaaca 
ctgcactact 
atacattcga 
actatgagtt 
tgcaagtgga 
ctaacgagcc 
ttgaagtgct 
tgttgtga 



catagaagcc 
aaaacaatca 
cggccatctc 
cggagactta 
cttggcggtg 
cgccgatcgg 
tgttgccaaa 
ctccgttagt 
taaacaattg 
taaattgaca 
tggagaaata 
ctatgatgtt 
tatgaaacaa 
taagaagaga 
gctttccaag 
agcaacagta 
ggtaatggcg 
caccattgtt 
tcaagcaatt 
cctatccgta 
tacaaatatt 
tccagaaaga 
gatcccgttt 
acacctttca 
tttggatatg 
aattacacct 



attgtaggac 
aaaatcttaa 
ttttatttca 
gctgacaaat 
agtagttatg 
ccagctttac 
tatggccctt 
cggctcgaaa 
tataattgtg 
ttcgacatca 
ctcaaagcag 
tttcacattc 
actttcaaag 
gaaacaaagg 
aggagcaacg 
ttcactttgg 
ttaatgataa 
ggtagagata 
gttaaagaag 
aaagattgtg 
atgaaacttc 
ttcttgacga 
ggatcaggga 
attgctcatt 
aaacaaggcg 
cgccttcctc 



ttgtaacctt 
acccactgcc 
acaacaatgg 
atggtcccgt 
aagctatgaa 
tttacggaga 
actggaaaaa 
aattcaaaca 
attcaccaat 
ttttgaaaat 
cttttcagaa 
catttttcaa 
acattgataa 
atgttggagg 
aacatcttgg 
tcttggatgc 
acaataagaa 
gatgggtaga 
tattacgatt 
ttgtcaatgg 
aacgagaccc 
ctaatgctgc 
gacgagcttg 
tgatccaggg 
tgggtctaac 
ctacgcttta 



tgcatttcta 
tccaaaaatc 
cgatgatgac 
cttcacattc 
agaatgcttc 
atacctttgc 
aaatcgaaag 
tgttagattt 
ggtgaagata 
ggttgttggg 
gttcatggtt 
gtggttggat 
tattatccaa 
tgaaaatgaa 
cgatggttac 
aacagacaca 
tgtcatgaag 
agagaatgat 
acatccacct 
ataccatatt 
acaaatatgg 
aattgactat 
tcccgcgatg 
tttcaatttt 
tttacctaag 
tcaatattaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1578 



<210> 11 
<211> 509 
<212> PRT 
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<213> Nicotiana tabacum 
<400> 11 

Met Glu Gly Thr Asn Leu Thr Thr Tyr Ala Ala Val Phe Leu Gly Thr 

15 10 15 

Leu Phe Leu Leu Leu Leu Ser Lys Phe Leu Arg Gin Arg Lys Leu Asn 

20 25 30 

Leu Pro Pro Gly Pro Lys Pro Trp Pro lie lie Gly Asn Leu Asn Leu 

35 40 45 

lie Gly Asn Leu Pro His Arg Ser lie His Glu Leu Ser Leu Lys Tyr 

50 55 60 

Gly Pro lie Met Gin Leu Gin Phe Gly Thr Phe Pro Val Val Val Gly 
65 70 75 80 

Ser Ser Val Glu Met Ala Lys Val Phe Leu Lys Ser Met Asp lie Asn 

85 90 95 

Phe Val Gly Arg Pro Lys Thr Ala Ala Gly Lys Tyr Thr Thr Tyr Asn 

100 105 110 

Tyr Ser Asp He Thr Trp Ser Pro Tyr Gly Pro Tyr Trp Arg Gin Ala 

115 120 125 

Arg Arg Met Cys Leu Met Glu Leu Phe Ser Thr Lys Arg Leu Asp Ser 

130 135 140 

Tyr Glu Tyr He Arg Ala Glu Glu Leu His Ser Leu Leu His Asn Leu 
145 150 155 160 

Asn Lys He Ser Gly Lys Pro He Val Leu Lys Asp Tyr Leu Thr Thr 

165 170 175 

Leu Ser Leu Asn Val He Ser Arg Met Val Leu Gly Lys Arg Tyr Leu 

180 185 190 

Asp Glu Ser Glu Asn Ser He Val Thr Pro Glu Glu Phe Lys Lys Met 

195 200 205 

Leu Asp Glu Leu Phe Leu Leu Asn Gly Val Leu Asn He Gly Asp Ser 

210 215 220 

He Pro Trp He Asp Phe Met Asp Leu Gin Gly Tyr Val Lys Arg Met 
225 230 235 240 

Lys Phe Val Ser Lys Lys Phe Asp Lys Phe Leu Glu His Val He Asp 

245 250 255 

Glu His Asn Val Arg Arg Asn Gly Val Glu Asn Tyr He Ala Lys Asp 

260 265 270 

Met Val Asp Val Leu Leu Gin Leu Ala Asp Asp Pro Thr Leu Glu Val 

275 280 285 

Lys Leu Glu Arg His Gly Val Lys Ala Phe Thr Gin Asp Met Leu Ala 

290 295 300 

Gly Gly Thr Glu Ser Ser Ala Val Thr Val Glu Trp Ala He Ser Glu 
305 310 315 320 

Leu Leu Lys Lys Pro Glu He Phe Lys Lys Ala Thr Glu Glu Leu Asp 

325 330 335 

Arg Val He Gly Gin Asn Arg Trp Val Gin Glu Lys Asp He Pro Asn 

340 345 350 

Leu Pro Tyr He Glu Ala He Val Lys Glu Thr Met Arg Leu His Pro 

355 360 365 

Val Ala Pro Met Leu Val Pro Arg Glu Cys Arg Glu Asp Cys Lys Val 
370 375 380 
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Ala Gly Tyr Asp Val Lys Lys Gly Thr Arg Val Leu Val Ser Val Trp 
385 390 395 400 

Thr lie Gly Arg Asp Pro Thr Leu Trp Asp Glu Pro Glu Ala Phe Lys 

405 410 415 

Pro Glu Arg Phe His Glu Lys Ser lie Asp Val Lys Gly His Asp Phe 

420 425 430 

Glu Leu Leu Pro Phe Gly Ala Gly Arg Arg Met Cys Pro Gly Tyr Asn 

435 440 445 

Leu Gly Leu Lys Val lie Gin Ala Ser Leu Ala Asn Leu lie His Gly 

450 455 460 

Phe Asn Trp Ser Leu Pro Asp Asn Met Thr Pro Glu Asp Leu Asp Met 
465 470 475 480 

Asp Glu lie Phe Gly Leu Ser Thr Pro Lys Lys Phe Pro Leu Ala Thr 

485 490 495 

Val lie Glu Pro Arg Leu Ser Pro Lys Leu Tyr Ser Val 
500 505 



<210> 12 
<211> 1530 
<212> DNA 

<213> Nicotiana tabacum 



<400> 12 

atggaaggta 

ctcctttcca 

ccgatcatcg 

tcactcaagt 

tcttccgtcg 

cctaaaacgg 

tatggaccat 

cgtctcgatt 

aataaaatat 

gttattagca 

actcctgagg 

attggagatt 

aaatttgtga 

aggagaaatg 

gctgatgatc 

gatatgcttg 

ctgctaaaga 

cagaatagat 

aaagagacta 

gactgtaagg 

actattggaa 

cacgaaaagt 

agaaggatgt 

cttatacatg 

gatgagattt 

agactttcac 



caaacttgac 
aatttcttcg 
gaaacttaaa 
acgggccaat 
aaatggccaa 
ccgccgggaa 
attggcgcca 
catacgagta 
cagggaaacc 
ggatggtact 
aatttaagaa 
caattccctg 
gcaagaaatt 
gagtggagaa 
cgacgttgga 
ctggtggaac 
agccagagat 
gggtacaaga 
tgcgactgca 
tagcaggcta 
gagaccctac 
ccattgatgt 
gcccgggtta 
gatttaactg 
ttgggctctc 
caaaacttta 



tacatatgca 
ccaaagaaaa 
ccttatcggc 
tatgcaactc 
ggttttcctc 
gtacacaact 
ggcacgtaga 
tattcgggct 
aattgtgctg 
ggggaagagg 
gatgttggac 
gattgatttc 
cgacaagttt 
ttacattgct 
agttaagctg 
cgagagttca 
tttcaaaaag 
aaaagacatt 
ccccgtggca 
cgacgttaag 
attgtgggac 
taaaggacat 
taacttgggg 
gtcattgcct 
cacacctaaa 
ctctgtttga 



gcagtatttc 
ctcaacttac 
aatcttcctc 
caattcggga 
aaatcaatgg 
tacaattatt 
atgtgcctaa 
gaggagttgc 
aaagattatt 
tatttggacg 
gagctgttct 
atggacttac 
ttggagcatg 
aaggacatgg 
gagagacatg 
gcagtgacag 
gctacagaag 
ccaaatcttc 
ccaatgttgg 
aaaggaacca 
gagcctgagg 
gattttgagc 
cttaaggtga 
gataatatga 
aagtttccac 



ttggtactct 
ctccaggccc 
atcgctcaat 
ctttccccgt 
atattaactt 
cagatattac 
tggaattatt 
attctctgct 
tgacgacgtt 
aatccgagaa 
tgctaaatgg 
aaggttatgt 
ttatcgatga 
ttgatgttct 
gagtcaaagc 
tggagtgggc 
aattggatcg 
cttacataga 
tgccacggga 
gggtccttgt 
cgttcaagcc 
ttttgccatt 
ttcaagctag 
ctcctgagga 
ttgctactgt 



gtttcttttg 
aaaaccatgg 
ccacgaactt 
tgttgttggc 
tgtaggcagg 
atggtctcct 
cagcacgaaa 
ccataatttg 
gagtttaaat 
ctcgattgtg 
tgtacttaat 
taagaggatg 
gcataacgtt 
gttgcagctt 
attcactcag 
aatttcggag 
agtaattggg 
ggcaatagtc 
gtgtcgagaa 
gagcgtatgg 
ggagaggttc 
tggagctggg 
cttagctaat 
cctcgacatg 
gattgagcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1530 
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<210> 13 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> derived from Nicotiana tabacutn p45 0 gene 
<400> 13 

ggcggagaat ttgtcctgga atgtcatttg gtttag 

<210> 14 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 gene 
<400> 14 

gtacaatagt gaggttgaca atg 

<210> 15 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 gene 
<400> 15 

ggtggttgtg aatgcatg 

<210> 16 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 gene 
<400> 16 

ttatgcagca ataggcttga agaca 

<210> 17 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 gene 
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<400> 17 

gggggatcca tgcaattctt cagcttggtt tec 



33 



<210> 18 
<211> 33 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> derived from Nicotiana tabacum p450 gene 
<400> 18 

ggggaattct tactctcgag aaggttgata agg 3 3 

<210> 19 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 gene 



<210> 20 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p4 50 gene 
<400> 20 

ggggaattct caatattgat aaagegtagg agg 33 

<210> 21 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 gene 



<400> 19 

cccggatcca tgtatcatct tctttctccc 



30 



<400> 21 

cccggatcca tgeaatcett cagcttggtt tec 



33 



<210> 22 
<211> 33 
<212> DNA 



<213> Artificial Sequence 
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<220> 

<223> derived from Nicotiana tabacum p450 gene 



<400> 22 

ggggagctct cactcgcaag aagattgata agg 



33 



<210> 23 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 gene 
<400> 23 

gccattatcg gcgcaatact aatctccaaa ctccgcggta aaaaattcaa gctcccacct 60 
ggtccaacag cagtc 75 

<210> 24 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> derived from Nicotiana tabacum p450 gene 
<400> 24 

gggggatcca tggacctcct cctcatagaa aaaaccctcg tcgccttatt cgccgccatt 60 
atcggcgcaa tacta 75 

<210> 25 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 gene 



<210> 26 

<211> 7 

<212> PRT 

<213> Nicotiana tabacum p450 protein 
<220> 

<221> VARIANT 

<222> 7 

<223> Xaa = Any Amino Acid 



<400> 25 

ggggagctct tatgcagcaa taggcttgaa gac 



33 
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<400> 26 

Lys Glu Thr Leu Arg Leu Xaa 
1 5 



<210> 27 
<211> 11 
<212> PRT 

<213> Nicotiana tabacum p4 50 protein 
<220> 

<221> VARIANT 
<222> 4,8 

<223> Xaa = Any Amino Acid 
<400> 27 

Pro Phe Gly Xaa Gly Arg Arg Xaa Cys Pro Ala 
15 10 



<210> 28 
<211> 11 
<212> PRT 

<213> Nicotiana tabacum p450 protein 
<220> 

<221> VARIANT 
<222> 4 , 8 

<223> Xaa = Any Amino Acid 
<400> 28 

Pro Phe Gly Xaa Gly Arg Arg Xaa Cys Pro Gly 
15 10 



<210> 29 
<211> 6 
<212> PRT 

<213> Nicotiana tabacum p450 protein 
<220> 

<221> VARIANT 
<222> 2 

<223> Xaa = Any Amino Acid 
<400> 29 

Phe Xaa Pro Glu Arg Phe 
1 5 



<210> 30 
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<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 protein 
<400> 30 

Ala Ala Arg Gly Ala Arg Ala Cys lie Tyr Thr lie Met Gly lie Tyr 

1 5 10 15 

Thr lie Cys Ala 
20 



<210> 31 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 protein 
<400> 31 

Ala Ala Arg Gly Ala Arg Ala Cys lie Tyr Thr He Met Gly He Tyr 

15 10 15 

Thr He Thr Ala 
20 



<210> 32 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 protein 
<400> 32 

Ala Ala Arg Gly Ala Arg Ala Cys He Tyr Thr He Met Gly He Tyr 

15 10 15 

Thr He Met Gly 
20 



<210> 33 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 protein 
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<400> 33 

Thr Thr Tyr lie lie lie Cys Cys lie Gly Ala Arg Met Gly lie Thr 

15 10 15 

Thr Tyr 



<210> 34 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 protein 
<400> 34 

Arg Ala Ala lie Cys Lys Tyr Thr Cys lie Gly Gly lie lie lie Arg 

15 10 15 

Ala Ala 



<210> 35 
<211> 20 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> derived from Nicotiana tabacum p450 protein 
<400> 35 

Gly Gly He Met Gly He Met Gly He He He He Thr Gly Tyr Cys 

15 10 15 

Cys He Gly Ser 
20 



<210> 36 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> derived from Nicotiana tabacum p450 protein 
<400> 36 

Cys Lys He Cys Lys He Cys Cys He He He He Cys Cys Arg Ala 

15 10 15 

Ala He Gly Gly 
20 
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<210> 37 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> derived from T7 Bacteriophage Promoter 
<400> 37 

gtaatacgac tcactatagg g 21 

<210> 38 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> derived from T3 Bacteriophage Promoter 
<400> 38 

caattaaccc tcactaaagg g 21 

<210> 39 
<211> 500 
<212> PRT 

<213> Mentha piperita 



<400> 39 
Met Glu Leu Gin 
1 

Thr lie Ser Leu 
20 

Glu Asn Leu Pro 
35 

His Leu Leu Trp 
50 

Lys Gin Tyr Gly 
65 

Val Val Leu Ser 

Asp Pro Ala Cys 
100 

Trp Tyr Asp Asn 
115 

Arg Gin Met Arg 
130 

Val Arg Ser Phe 
145 

Gly His Leu Arg 
Glu Arg lie Ala 



lie Ser Ser Ala 
5 

Leu lie lie Lys 

Pro Gly Pro Pro 
40 

Gly Lys Leu Pro 
55 

Pro Val Ala His 
70 

Ser Arg Glu Ala 
85 

Ala Asp Arg Phe 

Asp Asp lie lie 
120 

Lys lie Cys Val 
135 

Gly Phe lie Arg 
150 

Ser Ser Ala Ala 
165 

Thr Leu Thr Cys 



He He He Leu 
10 

Gin Trp Arg Lys 
25 

Lys Leu Pro Leu 

Gin His Ala Leu 
60 

Val Gin Leu Gly 
75 

Thr Lys Glu Ala 
90 

Glu Ser He Gly 
105 

Phe Ser Pro Tyr 

Ser Glu Leu Leu 
140 

Gin Asp Glu Val 
155 

Ala Gly Glu Ala 
170 

Ser He He Cys 



Val val Thr Tyr 
15 

Pro Lys Pro Gin 
30 

He Gly His Leu 
45 

Ala Ser Val Ala 

Glu Val Phe Ser 
80 

Met Lys Leu Val 
95 

Thr Lys He Met 
110 

Ser Val His Trp 
125 

Ser Ala Arg Asn 

Ser Arg Leu Leu 
160 

Val Asp Leu Thr 
175 

Arg Ala Ala Phe 
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<210> 40 
<211> 496 
<212> PRT 

<213> Mentha spicata 
<400> 40 

Met Glu Leu Asp Leu Leu Ser Ala He He lie Leu Val Ala Thr Tyr 
1 5 10 15 
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He Val Ser 

Asn Leu Pro 
35 

Phe Leu Trp 
50 

Lys Tyr Gly 
65 

Val Leu Ser 

Pro Asn Phe 

Tyr Asp Lys 
115 

Gin Met Arg 
130 

Arg Ser Phe 
145 

Leu Leu Gly 

Ser Lys Met 

Leu Lys Asp 
195 

Leu Ala Ser 
210 

Leu Asn Leu 
225 

Arg Leu Asp 

Lys Ser Gly 

Met Gin Lys 
275 

Lys Gly Phe 
290 

Thr Thr He 
305 

Met Ala Lys 

Val Val Asp 

Leu Lys Glu 
355 

Gin Ser Arg 

370 
Thr Arg He 
385 

Trp Glu Asp 
Arg Asp Phe 



Leu Leu He 
20 

Pro Ser Pro 

Gly Gly Leu 

Pro Val Ala 
70 

Ser Ala Glu 

85 

Ala Asp Arg 
100 

Asp Asp He 

Arg He Cys 

Gly Tyr He 
150 

Ser Ser Gly 
165 

Ser Cys Val 
180 

Gin Gly Ser 

Gly Phe Glu 

Leu Ser Leu 
230 

His He Leu 
245 

Glu Phe Gly 
260 

Gly Ser Asp 

He Phe Asp 

Ser Trp Ala 
310 

Val Gin Ala 

325 
Leu Ser Glu 
340 

Thr Leu Arg 

Glu Glu Cys 

Phe He Asn 
390 

Pro Asp Thr 

405 
Met Gly Asn 



Asn Gin Trp 
25 

Pro Lys Leu 
40 

Pro Gin His 
55 

His Val* Gin 

Ala Ala Lys 

Phe Asp Gly 
105 

He Phe Ser 
120 

Val Thr Glu 
135 

Arg Gin Glu 

Gly Ala Pro 

Val Val Cys 
185 

Leu Ala Glu 

200 
Leu Ala Asp 
215 

Asn Lys Tyr 

Asp Gly Phe 

Gly Glu Asp 
265 

He Lys He 

280 
Thr Phe Ser 
295 

Leu Ser Glu 

Glu Val Arg 

Val Gin Glu 
345 

Leu His Pro 
360 

Glu Val Asn 
375 

Val Trp Ala 
Phe Arg Pro 
Asp Phe Glu 



Arg Lys Ser 

Pro Val He 

Val Phe Arg 
60 

Leu Gly Glu 
75 

Gin Ala Met 
90 

He Gly Ser 

Pro Tyr Asn 

Leu Leu Ser 
140 

Glu lie Glu 

155 
Val Asp Val 
170 

Arg Ala Ala 

Leu Val Lys 

Leu Tyr Pro 
220 

Arg Leu Gin 

235 
Leu Glu Glu 
250 

lie Val Asp 

Pro lie Thr 

Ala Gly Ala 
300 

Leu Met Arg 
315 

Glu Ala Leu 
330 

Leu Lys Tyr 

Pro Phe Pro 

Gly Tyr Thr 
380 

lie Gly Arg 

395 
Glu Arg Phe 
410 

Phe lie Pro 



22- 



Lys Ser Gin Gin 
30 

Gly His Leu His 
45 

Ser lie Ala Gin 

Val Tyr Ser Val 
80 

Lys Val Leu Asp 
95 

Arg Thr Met Trp 
110 

Asp His Trp Arg 
125 

Pro Lys Asn Val 

Arg Leu lie Arg 
160 

Thr Glu Glu Val 
175 

Phe Gly Ser Val 
190 

Glu Ser Leu Ala 
205 

Ser Ser Trp Leu 

Arg Met Arg Arg 
240 

His Arg Glu Lys 
255 

Val Leu Phe Arg 
270 

Ser Asn Cys He 
285 

Glu Thr Ser Ser 

Asn Pro Ala Lys 
320 

Lys Gly Lys Thr 
335 

Leu Arg Ser Val 
350 

Leu lie Pro Arg 
365 

He Pro Ala Lys 

Asp Pro Gin Tyr 
400 

Asp Glu Val Ser 
415 

Phe Gly Ala Gly 
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420 

Arg Arg lie Cys 
435 

Pro Leu Ala Gin 
450 

Met Thr Asp Ala 
465 

Pro Lys Lys Lys 



Pro Gly Leu His 
440 

Leu Leu Tyr His 
455 

Asp Leu Asp Met 
470 

Asn Val Cys Leu 
485 



425 

Phe Gly Leu Ala 

Phe Asp Trp Lys 
460 

Thr Glu Thr Pro 
475 

Val Pro Thr Leu 
490 



430 

Asn Val Glu lie 
445 

Leu Pro Gin Gly 

Gly Leu Ser Gly 
480 

Tyr Lys Ser Pro 
495 



<210> 41 
<211> 509 
<212> PRT 

<213> Nepeta racemosa 
<400> 41 

Met Val Ser Leu Ser Tyr Phe Leu lie Ala Leu Leu Cys Thr Leu Pro 

15 10 15 

Phe Leu Leu Phe Leu Asn Lys Trp Arg Arg Ser Tyr Ser Gly Lys Thr 

20 25 30 

Pro Pro Pro Ser Pro Pro Lys Leu Pro Val lie Gly Asn Leu His Gin 

35 40 45 

Leu Gly Leu Tyr Pro His Arg Tyr Leu Gin Ser Leu Ser Arg Arg Tyr 

50 55 60 

Gly Pro Leu Met Gin Leu His Phe Gly Ser Val Pro Val Leu Val Ala 
65 70 75 80 

Ser Ser Pro Glu Ala Ala Arg Glu lie Met Lys Asn Gin Asp lie Val 

85 90 95 

Phe Ser Asn Arg Pro Lys Met Ser lie Ala Asn Arg Leu Phe Phe Asn 

100 105 110 

Asn Arg Asp Val Ala Phe Thr Gin Tyr Gly Glu Tyr Trp Arg Gin lie 

115 120 125 

Arg Ser lie Cys Val Leu Gin Leu Leu Ser Asn Lys Arg Val Gin Ser 

130 135 140 

Phe Arg Arg Val Arg Glu Glu Glu Thr Ser lie Met Val Glu Lys lie 
145 150 155 160 

Met Gin Leu Gly Ser Ser Ser Ser Thr Pro Val Asn Leu Ser Glu Leu 

165 170 175 

Leu Leu Ser Leu Thr Asn Asp Val Val Cys Arg Val Thr Leu Gly Lys 

180 185 190 

Lys Tyr Gly Gly Gly Asn Gly Ser Glu Glu Val Asp Lys Leu Lys Glu 

195 200 205 

Met Leu Thr Glu He Gin Asn Leu Met Gly He Ser Pro Val Trp Glu 

210 215 220 

Phe He Pro Trp Leu Asn Trp Thr Arg Arg Phe Asp Gly Val Asp Gin 
225 230 235 240 

Arg Val Asp Arg He Val Lys Ala Phe Asp Gly Phe Leu Glu Ser val 

245 250 255 

lie Gin Glu His Lys Glu Arg Asp Gly Asp Lys Asp Gly Asp Gly Asp 
260 265 270 
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Gly Ala Leu Asp Phe Val Asp lie Leu Leu Gin Phe Gin Arg Glu Asn 

275 280 285 

Lys Asn Arg Ser Pro Val Glu Asp Asp Thr Val Lys Ala Leu lie Leu 

290 295 300 

Asp Met Phe Val Ala Gly Thr Asp Thr Thr Ala Thr Ala Leu Glu Trp 
305 310 315 320 

Ala Val Ala Glu Leu lie Lys Asn Pro Arg Ala Met Lys Arg Leu Gin 

325 330 335 

Asn Glu Val Arg Glu Val Ala Gly Ser Lys Ala Glu lie Glu Glu Glu 

340 345 350 

Asp Leu Glu Lys Met Pro Tyr Leu Lys Ala Ser lie Lys Glu Ser Leu 

355 360 365 

Arg Leu His Val Pro Val Val Leu Leu Val Pro Arg Glu Ser Thr Arg 

370 375 380 

Asp Thr Asn Val Leu Gly Tyr Asp lie Ala Ser Gly Thr Arg Val Leu 
385 390 395 400 

lie Asn Ala Trp Ala lie Ala Arg Asp Pro Ser Val Trp Glu Asn Pro 

405 410 415 

Glu Glu Phe Leu Pro Glu Arg Phe Leu Asp Ser Ser lie Asp Tyr Lys 

420 425 430 

Gly Leu His Phe Glu Leu Leu Pro Phe Gly Ala Gly Arg Arg Gly Cys 

435 440 445 

Pro Gly Ala Thr Phe Ala Val Ala lie Asp Glu Leu Ala Leu Ala Lys 

450 455 460 

Leu Val His Lys Phe Asp Phe Gly Leu Pro Asn Gly Ala Arg Met Glu 
465 470 475 480 

Glu Leu Asp Met Ser Glu Thr Ser Gly Met Thr Val His Lys Lys Ser 

485 490 495 

Pro Leu Leu Leu Leu Pro lie Pro His His Ala Ala Pro 
500 505 



<210> 42 
<211> 471 
<212> PRT 

<213> Persea americana 
<400> 42 

Met Ala lie Leu Val Ser Leu Leu Phe Leu Ala lie Ala Leu Thr Phe 

15 10 15 

Phe Leu Leu Lys Leu Asn Glu Lys Arg Glu Lys Lys Pro Asn Leu Pro 

20 25 30 

Pro Ser Pro Pro Asn Leu Pro lie lie Gly Asn Leu His Gin Leu Gly 

35 40 45 

Asn Leu Pro His Arg Ser Leu Arg Ser Leu Ala Asn Glu Leu Gly Pro 

50 55 60 

Leu lie Leu Leu His Leu Gly His lie Pro Thr Leu lie Val Ser Thr 
65 70 75 80 

Ala Glu lie Ala Glu Glu lie Leu Lys Thr His Asp Leu lie Phe Ala 

85 90 95 

Ser Arg Pro Ser Thr Thr Ala Ala Arg Arg lie Phe Tyr Asp Cys Thr 
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100 



105 



110 



Asp Val Ala Phe Ser Pro Tyr Gly Glu Tyr Trp Arg Gin Val Arg Lys 

115 120 125 

lie Cys Val Leu Glu Leu Leu Ser lie Lys Arg Val Asn Ser Tyr Arg 

130 135 140 

Ser lie Arg Glu Glu Glu Val Gly Leu Met Met Glu Arg lie Ser Gin 
145 150 155 160 

Ser Cys Ser Thr Gly Glu Ala Val Asn Leu Ser Glu Leu Leu Leu Leu 

165 170 175 

Leu Ser Ser Gly Thr lie Thr Arg Val Ala Phe Gly Lys Lys Tyr Glu 

180 185 190 

Gly Glu Glu Glu Arg Lys Asn Lys Phe Ala Asp Leu Ala Thr Glu Leu 

195 200 205 

Thr Thr Leu Met Gly Ala Phe Phe Val Gly Asp Tyr Phe Pro Ser Phe 

210 215 220 

Ala Trp Val Asp Val Leu Thr Gly Met Asp Ala Arg Leu Lys Arg Asn 
225 230 235 240 

His Gly Glu Leu Asp Ala Phe Val Asp His Val lie Asp Asp His Leu 

245 250 255 

Leu Ser Arg Lys Ala Asn Gly Ser Asp Gly Val Glu Gin Lys Asp Leu 

260 265 270 

Val Asp Val Leu Leu His Leu Gin Lys Asp Ser Ser Leu Gly Val His 

275 280 285 

Leu Asn Arg Asn Asn Leu Lys Ala Val lie Leu Asp Met Phe Ser Gly 

290 295 300 

Gly Thr Asp Thr Thr Ala Val Thr Leu Glu Trp Ala Met Ala Glu Leu 
305 310 315 320 

lie Lys His Pro Asp Val Met Glu Lys Ala Gin Gin Glu Val Arg Arg 

325 330 335 

Val Val Gly Lys Lys Ala Lys Val Glu Glu Glu Asp Leu His Gin Leu 

340 345 350 

His Tyr Leu Lys Leu lie lie Lys Glu Thr Leu Arg Leu His Pro Val 

355 360 365 

Ala Pro Leu Leu Val Pro Arg Glu Ser Thr Arg Asp Val Val lie Arg 

370 375 380 

Gly Tyr His lie Pro Ala Lys Thr Arg Val Phe lie Asn Ala Trp Ala 
385 390 395 400 

lie Gly Arg Asp Pro Lys Ser Trp Glu Asn Ala Glu Glu Phe Leu Pro 

405 410 415 

Glu Arg Phe Val Asn Asn Ser Val Asp Phe Lys Gly Gin Asp Phe Gin 

420 425 430 

Leu lie Pro Phe Gly Ala Gly Arg Arg Gly Cys Pro Gly lie Ala Phe 

435 440 445 

Gly lie Ser Ser Val Glu lie Ser Leu Ala Asn Leu Leu Tyr Trp Phe 

450 455 460 

Asn Trp Glu Leu Pro Gly He 
465 470 



<210> 43 
<211> 509 
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<212> PRT 

<213> Arabidopsis thaliana 



<400> 43 
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